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ADENO-ASSOCIATED VIRUSES AND USES THEREOF 
Statement of O overnmpn f Rjghf S 
This invention was made at least in part with a grant from the 
Government of the United States of America (grant HL51887 from the National 
5 Institutes of Health). The Government may have certain rights in the invention. 



Background of the Inventio n 

Adeno-associated virus (AAV) is a non-pathogenic parvovirus with a 
single-stranded. DNA genome of 4680 nucleotides. The genome may be of either 
10 plus or minus polarity, and codes for two groups of genes, Rep and Cap (Bems 
et al., 1990). Inverted terminal repeats (ITRs), characterized by palindromic 
sequences producing a high degree of secondary structure, are present at both 
ends of the viral genome. While other members of the parvovirus group 
replicate autonomously, AAV requires co-infection with a helper virus (i.e., 
15 adenovirus or herpes virus) for lytic phase productive replication. In the absence 
of a helper virus, wild-type AAV (wtAAV) establishes a latent, non-productive 
infection with long-term persistence by integrating into a specific locus on 
chromosome 1 9, AAVS 1 , of the host genome through a Rep-facilitated 
mechanism (Samulski, 1993; Linden et al, 1996; Kotin et al., 1992). 
20 In contrast to wtAAV, the mechanism(s) of latent phase persistence of 

recombinant AAV (rAAV) is less clear. rAAV integration into the host genome 
is not site-specific due to deletion of the AAV Rep gene (Ponnazhagan et al., 
1997). Analysis of integrated proviral structures of both wild type and 
recombinant AAV have demonstrated head-to-tail genomes as the predominant 
25 structural forms. 

rAAV has recently been recognized as an extremely attractive vehicle for 
gene delivery (Muzyczka, 1992). rAAV vectors have been developed by 
substituting all viral open reading frames with a therapeutic minigene, while 
retaining the cis elements contained in two inverted terminal repeats (ITRs) 
30 (Samulski et al., 1987; Samulski et al., 1989). Following transduction, rAAV 
genomes can persist as cpisomes (Flotte et al., 1994; Afione et al., 1996; Duan et 
al. t 1998), or alternatively can integrate randomly into the cellular genome 
(Bems et al., 1996; McLaughlin et al., 1988; Duan et al., 1997; Fisher-Adams et 
ai.; 1996; Kearns et al., 1996; Ponnazhagan et al., 1997). However, little is 
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known about the mechanisms enabling rAAV vectors to persist in vivo or the 
identity of cellular factors which may modulate the efficiency of transduction 
and persistence. Although transduction of rAAV has been demonstrated in vitro 
in cell culture (Muzyczka, 1992) and in vivo in various organs (Kaplitt et al., 
5 1994; Walsh et al., 1994; Conrad et al., 1996; Herzog et al., 1997; Snyder et al., 
1997), the mechanisms of rAAV-mediated transduction remain unclear. 

Moreover, while rAAV has been shown to be capable of stable, long- 
term transgene expression both in vitro and in vivo in a variety of tissues, the 
transduction efficiency of rAAV is markedly variable in different cell types. For 
1 0 example, rAAV has been reported to transduce lung epithelial cells at low levels 
(Halbert et al., 1997; Duan et al., 1 998a), while high level, persistent transgene 
expression has been demonstrated in muscle, neurons and in other non-dividing 
cells (Kessler et al., 1996; Fisher et al., 1997; Herzog et al., 1997; Xiao et al., 
1996; Kaplitt et al., 1994; Wu et al., 1998; Ali et al., 1996; Bennett et al., 1997 
15 Westfall et al., 1997). These tissue-specific differences in rAAV mediated gene 
transfer may, in part, be due to variable levels of cellular factors affecting AAV 
infectivity (i.e., receptors and co-receptors such as heparin sulfate proteoglycan, 
FGFR-1, and aVp5 integrin) (Summerford et al., 1998; Qing et al., 1999; 
Summerford et al., 1999) as well as the latent life cycle (i.e., nuclear trafficking 
20 of virus and/or the conversion of single stranded genomes to expressible forms) 
(Qing et al, 1997; Qing et al., 1998). 

Muscle-mediated gene transfer represents a very promising approach for 
the treatment of hereditary myopathies and several other metabolic disorders. 
Previous studies have demonstrated remarkably efficient and persistent transgene 
25 expression to skeletal muscle in vivo with rAAV vectors. Applications in this 
model system include the treatment of several inherited disorders such as Factor 
IX deficiency in hemophilia B and epo deficiencies (Kessler et al., 1 996; Herzog 
et al., 1997). Although the conversion of low-molecular- weight rAAV genomes 
to high-molecular-weight concatamers has been inferred as evidence for 
30 integration of proviral DNA in the host genome, no direct evidence exists in this 
regard (Xiao et al., 1996; Clark et al., 1997; Fisher et al. 1997). Also, the 
molecular processes and/or structures associated with episomal long-term 
persistence of rAAV genomes, e.g., in nondividing mature myofibers, remains 
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unclear. 

In addition, due to limitations in rAAV vector packaging capacity, a 
rAAV vector may not be useful if large regulatory elements are needed to control 
transgene expression. 
5 Thus, there is a need for rAAV vectors that have increased stability 

and/or persistence in host cells. Moreover, there is a need for vectors useful to 
express large open reading frames. 

Summary of the Invention 

10 The present invention provides a recombinant adeno-associated virus 

(rAAV) vector comprising a nucleic acid segment formed by the juxtaposition of 
sequences in the AAV inverted terminal repeats (ITRs) which are present in a 
circular intermediate of AAV. The circular intermediate was isolated from 
rAAV-infected cells by employing a recombinant AAV "shuttle" vector. The 

15 shuttle vector comprises: a) a bacterial origin of replication; b) a marker gene or 
a selectable gene; c) a 5' ITR; and d) a 3' 1TR. Preferably, the recombinant 
AAV shuttle vector contains a reporter gene, e.g., a GFP, alkaline phosphatase or 
P-galactosidase gene, a selectable marker gene, e.g., an ampicillin-resistance 
gene, a bacterial origin of replication, a 5' ITR and a 3' ITR. The vector is 

20 contacted with eukaryotic cells so as to yield transformed eukaryotic cells. Low 
molecular weight DNA ("Hirt DNA") from the transformed eukaryotic cells is 
isolated. Bacterial cells are contacted with the Hirt DNA so as to yield 
transformed bacterial cells. Then bacterial cells are identified which express the 
marker or selectable gene present in the shuttle vector and which comprise at 

25 least a portion of a circular intermediate of adeno-associated virus. Also, as 
described below, it was found that circularized intermediates of rAAV impart 
episomal persistence to linked sequences in Hela cells, fibroblasts and muscle 
cells. In HeLa cells, the incorporation of certain AAV sequences, e.g., ITRs, 
from circular intermediates into a heterologous plasmid conferred a 10-fold 

30 increase in the stability of plasmid-based vectors in HeLa cells. Unique features 
of these transduction intermediates included the in vivo circularization of a 
head-to-tail monomer as well as multimer (concatamers) episomal viral genomes 
with associated specific base pair alterations in the 5' viral D-sequence. The 

3 
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majority of circular intermediates had a consistent head-to-tail configuration 
consisting of monomer genomes (<3 kb) which slowly converted to large 
multimers of >12 kb by 80 days post-infection in muscle. Importantly, 
long-term transgene expression was associated with prolonged (80 day) episomal 
5 persistence of these circular intermediates. Thus, in vivo persistence of rAAV 
can occur through episomal circularized genomes which may represent 
prointegration intermediates with increased episomal stability. Moreover, as 
described below, co-infection with adenovirus, at high multiplicities of infection 
(MOI) capable of producing early adenoviral gene products, led to increases in 
10 the abundance and stability of AAV circular intermediates which correlated with 
an elevation in transgene expression from rAAV vectors. Thus, these results 
demonstrate the existence of a molecular structure involved in AAV transduction 
which may play a role in episomal persistence and/or integration. 

Further, these results may aid in the development of non- viral or viral- 
1 5 based gene delivery systems having increased efficiency. For example, 
therapeutic or prophylactic therapies in which the present vectors are useful 
include blood disorders (e.g., sickle cell anemia, thalassemias, hemophilias, and 
Fanconi anemias), neurological disorders, such as Alzheimer's disease and 
Parkinson's disease, and muscle disorders involving skeletal, cardiac or smooth 
20 muscle. In particular, therapeutic genes useful in the vectors of the invention 
include the P-globin gene, the y-globin gene, the cystic fibrosis transmembrane 
conductance receptor gene (CFTR), the erythropoietin (epo) gene, the Fanconi 
anemia complementation group, a gene encoding a ribozyme, an antisense gene, 
a low density lipoprotein (LDL) gene, a tyrosine hydroxylase gene (Parkinson's 
25 disease), a glucocerebrosidase gene (Gaucher's disease), an arylsulfatase A gene 
(metachromatic leukodystrophies) or genes encoding other polypeptides or 
proteins. Also within the scope of the invention is the inclusion of more than 
one gene in a vector of the invention, i.e., a plurality of genes may be present in 
an individual vector. Further, as a circular intermediate may be a concatamer, 
30 each monomer of that concatamer may comprise a different gene, or a portion 
thereof. 

For viral-based delivery systems, helper- free virus can be prepared (see 
WO 95/13365) from circular intermediates or vectors of the invention. 
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Alternatively, liposomes, plasmid or virosomes may be employed to deliver a 
vector of the invention to a host or host cell. 

The increased persistence of circular intermediates or vectors having one 
or a plurality of ITRs may be due to the primary and/or secondary structure of 
5 the ITRs. The primary structure of a consensus sequence .(SEQ ID NO:3) of 
ITRs formed by the juxtaposition and physical (phosphodi ester bond) linkage of 
ITRs from AAV is shown in Figure 2C. However, as described hereinbelow, 
each ITR sequence may be incomplete, i.e., the ITR may be a subunit or portion 
of the full length ITRs present in the consensus sequence. Moreover, preferably, 
1 0 an isolated DNA segment of the invention is not the 1 65 bp double DD sequence 
(SEQ ID NO:7) disclosed in U.S. Patent No. 5,478,745, referred to as a "double 
sequence". 

Moreover, the formation, persistence and/or abundance of molecules 
having the ITR sequences of the invention may be modulated by helper virus, 
15 e.g., adenoviral proteins and/or host cell proteins. Thus, the circular 

intermediates or vectors of the invention may be useful to identify and/or isolate 
proteins that bind to the ITR sequences present in those molecules. 

Therefore, the present invention provides an isolated and purified DNA 
molecule comprising at least one DNA segment, a biologically active subunit or 

20 variant thereof, of a circular intermediate of adeno-associated virus, which DNA 
segment confers increased episomal stability, persistence or abundance of the 
isolated DNA molecule in a host cell. Preferably, the DNA molecule comprises 
at least a portion of a left (5') inverted terminal repeat (ITR) of adeno-associated 
virus. Also preferably, the DNA molecule comprises at least a portion of a right 

25 (3')-inverted terminal repeat of adeno-associated virus. The invention also 
provides a gene transfer vector, comprising: at least one first DNA segment, a 
biologically active subunit or variant thereof, of a circular intermediate of adeno- 
associated virus, which DNA segment confers increased episomal stability or 
persistence of the vector in a host cell; and a second DNA segment comprising a 

30 gene. Preferably, the second DNA segment encodes a therapeutically effective 
polypeptide. The first DNA segment comprises ITR sequences, preferably at 
least about 100, more preferably at least about 300, and even more preferably at 
least about 400, bp of adeno-associated virus sequence. A preferred vector of the 
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invention is a plasmid. 

Thus, the vector of the invention is useful in a method of delivering 
and/or expressing a gene in a host cell, to prepare host cells having the vector(s), 
and in the preparation of compositions comprising such vectors. To deliver the 
5 gene to the host cell, a recombinant adenovirus helper virus may be employed. 

The implications of intermolecular recombination of rAAV genomes to 
form a single circular episome, which may be a circular concatamer comprising 
at least two different rAAV genomes, is particularly relevant for gene therapy 
with rAAV. First, large regulatory elements and genes beyond the packaging 
1 0 capacity of rAAV can be brought together by co-infecting tissue with two 
independent vectors. For example, enhancers and/or promoters may be 
introduced into one vector while DNA comprising an open reading frame, e.g., a 
gene of interest, with or without a minimal promoter, is introduced into a second 
vector. Thus, after co-infection with the two vectors, the transgene cassette size 
1 5 is increased beyond that for a single AAV vector alone and the DNA comprising 
the opening reading frame is linked to the enhancer and/or promoter. In another 
embodiment, of the invention, vectors encoding two independent regions of a 
gene are brought together to form an intact splicing unit by circular 
concatamerization. In a further embodiment of the invention, a vector 
20 comprising an origin of replication and a DNA encoding a protein that binds to 
the origin and promotes replication and/or maintenance of DNA that is linked to 
the origin, and a vector comprising a gene of interest are brought together after 
co-infection to form an autonomously replicating episome comprising the gene. 
As described hereinbelow, the tibialis muscle of mice was co-infected 
25 with rAAV Alkaline phosphatase (Alkphos) and GFP encoding vectors. The 
GFP shuttle vector also encoded ampicillin resistance and a bacterial origin of 
replication to allow for bacterial rescue of circular intermediates in Hirt DNA 
from infected muscle samples. There was a time dependent increase in the 
abundance of rescued plasmids encoding both GFP and Alkphos that reached 
30 33% of the total circular intermediates by 120 days post-infection. Furthermore, 
these large circular concatamers were capable of expressing both GFP and 
Alkphos encoded transgenes following transient transfection in cell lines. Thus, 
concatamerization of AAV genomes in vivo occurs through intermolecular 
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third DNA segment comprising at least a portion of an open reading frame which 
together with the second DNA segment of the first vector encodes a full-length 
polypeptide; and a fourth DNA segment comprising a 3' ITR. Preferably, the 
second DNA segment of the first vector comprises a first exon of a gene 
5 comprising more than one exon and the third DNA segment of the second vector 
comprises at least one exon of a gene that is not the first exon. 

The invention also provides a method to transfer and express a 
polypeptide in a host cell. The method comprises contacting the host cell with at 
least two rAAV vectors. One rAAV vector comprises a first DNA segment 
10 comprising a 5 'ITR linked to a second DNA segment comprising a promoter 
operably linked to a third DNA segment comprising a first open reading frame 
linked to a fourth DNA segment comprising a 3' ITR. A second rAAV vector 
comprises a first DNA segment comprising a 5' ITR linked to a second DNA 
segment comprising a promoter operably linked to a third DNA segment 
15 comprising a second open reading frame linked to a fourth DNA segment 
comprising a 3 ITR. Alternatively, one rAAV vector comprises a first DNA 
segment comprising a 5 'ITR linked to a second DNA segment comprising a 
promoter operably linked to a third DNA segment comprising the 5' end of an 
open reading frame linked to fourth DNA segment comprising a 5' splice site 
20 linked to a fifth DNA segment comprising a 3 ' ITR. The second rAAV vector 
comprises a first DNA segment comprising a 5' ITR linked to a second DNA 
segment comprising a 3' splice site linked to a third DNA segment comprising 
the 3' end of the open reading frame linked to a fourth DNA segment comprising 
a 3 TTR. The host cell is preferably contacted with both of the vectors, 
25 concurrently, although it is envisioned that the host cell may be contacted with 
each vector at a different time relative to the contact with the other vectors). 

Also provided is a method in which the composition of the invention is 
administered to the cells or tissues of an animal. For example, rAAV vectors 
have shown promise in transferring the CFTR gene into airway epithelial cells of 
30 animal models and nasal sinus of CF patients. However, high level expression 
of CFTR has not been achieved due to the fact that AAV cannot accommodate 
the full-length CFTR gene together with a potent promoter. A number of studies 
have tried to optimize rAAV-mediated CFTR expression by utilizing truncated 

8 



W0 01/25465 PCTAJS00/27863 

or partially deleted CFTR genes together with stronger promoters. However, it 
is currently unknown what effect deletions within the CFTR gene may have on 
complementation of bacterial colonization defects in the CF airway. Therefore, 
the present invention includes the administration to an animal of a composition 
5 of the invention comprising at least two rAAV vectors which together encode 
CFTR. The present invention is useful to overcome the current size limitation 
for transgenes within rAAV vectors, and allows for the incorporation of a larger 
transcriptional regulatory region, e.g., a stronger heterologous promoter or the 
endogenous CFTR promoter. 
10 As described hereinbelow, transgene expression from rAAV luciferase 

vectors, with or without a promoter, can be greatly enhanced by co-infection 
with an independent rAAV vector carrying the cytomegalovirus (CMV) and 
simian virus 40 (SV40) enhancers. Thus, co-infection with a transgene 
containing vector and a second vector comprising at least one, preferably at least 
15 two or more, enhancer sequences, of cell lines and muscle in vivo resulted in a 
greater than 600- fold enhancement of transgene expression from a minimal 
S V40 promoter. Furthermore, 200- fold enhancement was also achieved by cis- 
activation of ITRs in transgene containing vectors without a promoter. Thus, 
large regulatory elements including tissue specific enhancers can be introduced 
20 into cells by a separate rAAV to regulate the expression of a second transgene 
containing vector in cis following intracellular concatamerization. 

Thus, the invention provides a composition comprising at least two 
recombinant AAV genomes. The composition comprises a first recombinant 
AAV comprising a first recombinant DNA molecule comprising linked: i) a first 
25 DNA segment comprising a 5 '-inverted terminal repeat of AAV; ii) a second 
DNA segment which does not comprise AAV sequences; and iii) a third DNA 
segment comprising a 3'-inverted terminal repeat of AAV; and comprises a 
second recombinant AAV comprising a second recombinant DNA molecule 
comprising linked: i) a first DNA segment comprising a 5 '-inverted terminal 
30 repeat of AAV; ii) a second DNA segment which does not comprise AAV 
sequences and which second DNA segment is different than the second DNA 
segment of the first recombinant DNA molecule; and iii) a third DNA segment 
comprising a 3 '-inverted terminal repeat of AAV. The composition of the 
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invention, is preferably contacted with a mammalian host cell, e.g., a murine, 
canine, feral or human cell. Alternatively, a host cell may be contacted with 
each recombinant AAV individually, e.g., sequentially. 

Thus, in one embodiment of the invention, a host cell is contacted with at 
5 least two recombinant AAV genomes. A first recombinant AAV comprises a 
first recombinant DNA molecule comprising linked:i ) a first DNA segment 
comprising a 5 '-inverted terminal repeat of AAV; ii) a second DNA segment 
which does not comprise AAV sequences; and iii) a third DNA segment 
comprising a 3 '-inverted terminal repeat of AAV. A second recombinant AAV 

10 comprises a second recombinant DNA molecule comprising linked: i) a first 
DNA segment comprising a 5 '-inverted terminal repeat of AAV; ) a second 
DNA segment which does not comprise AAV sequences and which second DNA 
segment is different than the second DNA segment of the first recombinant DNA 
molecule; and iii) a third DNA segment comprising a 3 '-inverted terminal repeat 

15 of AAV. 

In one embodiment of the invention, the second DNA segment of the first 
recombinant DNA molecule comprises a portion of an open reading frame, e.g., 
an exon of a multi-exon gene, operably linked to a promoter. For example, the 
promoter may be the endogenous promoter for the gene corresponding to the 

20 open reading frame. Preferably, the second DNA segment of the second 

recombinant DNA molecule comprises the remainder of the open reading frame 
which together with the second DNA segment of the first recombinant DNA 
molecule encodes a full-length polypeptide. Also preferably, the first 
recombinant DNA molecule comprises a splice donor site 3' to the open reading 

25 frame, and the second DNA segment of the second recombinant DNA molecule 
comprises a splice acceptor site 5' to the remainder of the open reading frame. 

In another embodiment of the invention, the second DNA segment of the 
first recombinant DNA molecule comprises at least one heterologous enhancer 
and/or at least one heterologous promoter, i.e., the enhancer and/or promoter 

30 sequences are not derived from AAV sequences. Preferably, the second DNA 
segment of the second recombinant DNA molecule comprises at least a portion 
of an open reading frame. 
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In yet a farther embodiment of the invention, the second DNA segment 
of the first recombinant DNA molecule comprises an origin of replication 
functional in a host cell, e.g., a viral origin of replication such as OriP. 
Preferably, the origin is functional in a human cell. Also preferably, the second 
5 DNA segment of the first recombinant DNA molecule further comprises DNA 
encoding a protein that binds to the origin of replication, e.g., EBNA-1 . The 
second DNA segment in the second recombinant DNA molecule comprises at 
least a portion of an open reading frame, and preferably a promoter operably 
linked to the open reading frame. 

10 

In yet another embodiment of the invention, the second DNA segment of 
the first recombinant DNA molecule comprises a as-acting integration 
sequence(s) for a recombinase and also encodes a recombinase or integrase that 
is specific for the integration sequence(s), e.g., Cre/lox system of bacteriophage 

15 PI (U.S. Patent No. 5,658,772), the FLP/FRT system of yeast, the Gin 

recombinase of phage Mu, the Pin recombinase of E. coli, the R/RS system of 
the pSRl plasmid, a retrotransposase or the integrase from a lentivirus or 
retrovirus. The second DNA segment in the second recombinant DNA molecule 
comprises at least a portion of an open reading frame, and preferably a promoter 

20 operably linked to the open reading frame. The formation of a concatamer 
comprising the first and the second recombinant DNA molecules, and the 
expression of the recombinase or integrase, will enhance the integration of the 
concatamer, or a portion thereof, into the host genome. 



25 Brief Description of the Fig ures 

Figure 1 . Structure of pro viral shuttle vector and the predicted structure 
of rAAV circular intermediate monomers. With the aid of a rAAV m-acting 
plasmid, pCisAV.GFP3ori (Panel A), AV.GFP3ori recombinant virus was 
produced (Panel B). This vector encoded a GFP transgene cassette, an 

30 ampicillin resistance gene (amp), and a bacterial replication origin (ori). The 
predominant form of circular intermediates isolated following transduction of 
Hela cells with AV.GFP3ori consisted of head-to-tail monomers (Panels C and 
D). 
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expression in the presence of increasing amounts (0, 500, 5000 and 10000 
particles/cell) of recombinant Ad.CMVlacZ was quantified by FACS analysis at 
72 hour post-infection (Panel C). Results demonstrate the mean (+/-SEM) for 
two experiments performed in duplicate. In addition, an aliquot of cells was split 
5 (1 : 10) at the time of FACS analysis and GFP colony forming units (CFU) per 
10X field were quantified at 6 days (CPE denotes significant cytopathic effects 
at an adenoviral MOI of 10,000 particles/cell and was not quantified for GFP 
colonies). Hirt DNAs from A V.GFP3ori (1000 DNA particles/cell) infected 
Hela cells with or without co-infection with Ad.CMVlacZ (5,000 particles/cell) 

10 were used to transform E. coli. The total number of ampicillin-resistant bacterial 
CFU (Panel D) and total number of head- to-tail circular intermediates CFU 
(Panel E) are given for a representative experiment. Greater than 20 clones for 
each time point were evaluated by Southern blot (see Figure 2 for detail). Zero 
hour controls were performed by mixing an equivalent amount of AV.GFP3ori 

15 virus as used in experiments with mock infected cellular lysates prior to Hirt 
purification. Panel F depicts the abundance of head-to-tail circular intermediates 
as a percentage of total ampicillin-resistant bacterial CFU isolated from Hirt 
DNA. 

Figure 4. Formation of rAAV head-to-tail circular intermediates 
20 following in vivo transduction of muscle. The tibialis anterior muscle of 4-5 
week old C57BL/6 mice were infected with AV.GFP3ori (3 X 1010 particles) in 
HEPES buffered saline (30 jil). GFP expression (Panel A) was analyzed by 
direct immunofluorescence of freshly excised tissues and/or in formalin-fixed 
cryopreserved tissue sections in four independently injected muscles harvested at 
25 0, 5, 10, 16, 22 and 80 days post-infection. GFP expression was detected at low 
levels beginning at 10 days and was maximum at 22 days post-infection. 
Expression remained stable to 80 days at which time greater than 50% of the 
tissue was positive (see 80 day tissue cross section counter stained with 
propidium iodide, panel A). Hirt DNA was isolated from muscle samples at 
30 each of the various time points and after points was used to transform E. coli. 
Rescued plasmids (p439, pi 6, pi 7) were analyzed by Southern blotting in Panel 
B showing an agarose gel on left and ITR probed blot on right. U:uncut, P:PstI 
cut, and S:SphI cut. The schematic drawing of the most predominant type of 
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head-to-tail circular AAV intermediate plasmids rescued from bacteria is given 
in the right of Panel B and shows the structure of pi 7 as an example. Other 
typical clones included those with less than two ITRs as shown for pi 6. SphI 
digestion of p 16 and pi 7 plasmids released ITR hybridizing fragments of 
5 approximately 140 and 300 bp, respectively. The slightly lower mobility then 
predicted for these ITR fragments likely represents anomalous migration due to 
the high secondary structure of inverted repeats within ITRs. Sequence analysis 
of pl7 and pl6 using nested primers to 5' and 3'-ITRs also confirmed the ITR 
orientations shown to the right of the gel. Additional restriction enzyme 
10 analyses to determine this structure included double and single digests with 

SphI, PstI, Asel, and/or Smal. An example of an atypical clone (p439) rescued 
from bacteria with unknown structure is also shown. 

Figure 5. Frequency of circular intermediate formation in muscle 
following transduction with rAAV. Hirt DNAs isolated from rAAV infected 
15 tibialis muscle were used to transform E. coli and the rescued plasmids analyzed 
by Southern blotting (greater than 20 clones were analyzed from at least two 
independent muscle samples for each time point). The averages of total 
head-to-tail circular intermediate clones (line) and ampicillin resistant bacterial 
clones (bar) isolated from each tibialis anterior muscle at 0, 5, 10, 16, 22 and 80 
20 days post-infection are summarized in Panel A. Only plasmids which contained 
1-2 ITRs were included in the estimation of total head-to-tail circular 
intermediates. Plasmids which demonstrated an absence of ITR hybridizing 
SphI fragments (between 150 to 300 bp) were omitted from the calculations. 
Panel B demonstrates the diversity of ITR arrays found in head-to-tail circular 
25 intermediates at 80 days post-infection. This panel depicts a Southern blot 

probed with ITR sequences and represents circular intermediates with 1-3 ITRs. 
SphI fragments which hybridize to ITR probes indicate the size of inverted ITR 
arrays (marked by arrows to right of gel). Additional restriction enzyme analysis 
was used to determine the structure of monomer and multimer circular 
30 intermediates. Examples are shown for two multimer (pl36 and p!43) circular 
intermediates which contain approximately three AAV genomes. Undigested 
plasmids of pl36 and pl43 migrate greater than 12 kb and is contrasted to the 
most predominant form of head-to-tail undigested circular intermediates at 22 
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days which migrate at 2.5 kb. The digestion pattern of pl36 is consistent with a 
uniform head-to-tail configuration of three genomes which is indistinguishable 
from digestion patterns of P 139 which contains one circularized genome 
(undigested pl39 migrates at 2.5 kb, data not shown, also see examples P 17 in 
5 Figure 4). In contrast, pi 36 depicts a more complex head-to-tail multimer 
circular intermediate which has various deletions and duplications within the 
ITR arrays. Predicted structure of five representative intermediates is 
schematically shown in Panel C. 

Figure 6. Molecular size of circular intermediates in muscle. HirtDNA 
1 0 from AV.GFP3ori infected muscle was size fractionated by electrophoresis and 
various molecular weight fractions transformed into E. coli. Results demonstrate 
the abundance of circular intermediates at each of the given molecular weights at 
22 and 80 days post-infection with the rAAV shuttle vector. Structure of circular 
intermediates were confirmed by Southern blot restriction analysis. 
1 5 Figure 7. Head-to-tail circular intermediates demonstrate increased 

stability of GFP expression following transient transfection in Hela cells. 
Subconfluent monolayers of Hela cells were co-transfected with p81, p87, or 
pCMVGFP and pRSVlacZ as an internal control for transfection efficiency as 
described in the methods. Panel A demonstrates the expansion of GFP clones 
20 after one passage (arrows). Quantification of clone size and numbers are shown 
in Panel B. Clone size represents the mean raw values while clone numbers are 
normalized for transfection efficiency as determined by X-gal staining for 
pRSVlacZ. The data at the top of bar graph values for each construct in Panel B 
represents quantification of GFP clones after second passage (also normalized 
25 for transfection efficiency). Results indicate the mean (+/-SEM) of duplicate 
experiments with greater than 20 fields quantified for each experimental point. 
The persistence of transfected p81 and pCMVGFP plasmid DNA at passage-7 
post-transfection was evaluated by genomic Southern blot of total cellular DNA 
hybridized against 32 P-labeIed GFP probe (Panel C, results from two independent 
30 transfections are shown). U:uncut, CrPstI cut. The migration of uncut dimer and 
monomer plasmids forms are marked on the left. PstI digestion of the plasmids 
results in bands at 4.7 kb (pCMVGFP, single PstI site in plasmid) and 1.7 kb 
(p8 1 , two PstI sites flanking the GFP gene). To determine whether the 
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head-to-tail ITR array within circular intermediates was responsible for increases 
in the persistence of GFP expression, the head-to-tail ITR DNA element was 
subcloned into the pGL3 luciferase plasmid to generate pGL3(ITR). Results in 
Panel D compare the extent of luciferase transgene expression following 
5 transfection with pGL3 and pGL3(lTR) at 10 days (passage-2) post-transfection. 
Results are the mean (+/-SEM) for triplicate experiments and are normalized for 
transfection efficiency using a dual renilla luciferase reporter vector (pRLSV40, 
Promega). 

Figure 8. Identification of adenoviral genes responsible for augmentation 
10 of AAV circular intermediate formation. Hela cells were infected with 

AV.GFP3ori (1 000 DNA particles/cell) in the presence of wtAdS, rf/802 (E2a- - 
deleted), and dl 1004 (E4-deIeted) adenovirus (at the indicated MOIs). Total 
number of head-to-tail circular intermediates from Hirt DNA and the level of 
augmentation of GFP transgene expression (as determined by FACS) was 

15 quantified at 24 hours post-infection (Panel A). Results are the average of 
duplicate experiments. Panel B depicts results from Southern blot analysis of 
Hirt DNA following hybridization to a GFP P 32 -labeled probe. DNA loads were 
10% of the total Hirt yield from a 35 mm plate of Hela cells. Infections were 
carried out identically to that described for Panel A. Arrows mark replication 

20 form concatamers (Rf c ), dimers (RQ, monomers (RfJ, and single-stranded 
AAV genomes (ssDNA). 

Figure 9. Model for independent mechanistic interactions of adenovirus 
with lytic and latent phase aspects of the AAV life cycle. The adenoviral E4 
gene has been shown to augment the level of rAAV second strand synthesis 

25 giving rise to replication form dimers (Rfj) and monomers (RfJ (Figure 8B). 
This augmentation leads to substantial increases in transgene expression from 
rAAV vectors and most closely mirrors lytic phase replication of wtAAV as 
head-to-head and tail-to-tail concatamers. In contrast, E4 expression inhibits the 
formation of head- to-tail circular intermediates of AAV. Hence, it appears that 

30 increases in the amount of Rf d and Rf m double stranded DNA genomes does not 
increase the extent of circular intermediate formation. Such findings suggest that 
conversion of Rj^ and Rf d to circular intermediates does not likely occur and 
implicates two mechanistically distinct pathway for their formation. In support 
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of this hypothesis, adenoviral E2a gene expression does not enhance the 
formation of and Rf d genomes but rather increase the abundance and/or 
stability of head- to- tail circular intermediates. Furthermore, in the absence of 
E4, E2a gene expression does not lead to augmentation of rAAV transgene 
5 expression. Since circular intermediates have increased episomal stability in 
muscle and in Hela cells, this molecular structure may be important in the latent 
phase of AAV persistence. Alternatively, these circular intermediates may 
represent pre-integration complexes as previously hypothesized for Rep 
facilitated integration. In the absence of Rep, circular intermediates may 
10 accumulate episomally in rAAV infected cells. In summary, these findings 

support the notion that adenovirus may modulate both latent and lytic aspects of 
the AAV life cycle. 

Figure 10. Individual chemical sequence of SphI fragments from p81 (A; 
SEQ ID NO:4), p79 (B; SEQ ID NO:5), and pl202 (C; SEQ ID NO:6) AAV 
1 5 circular intermediates. The ends of the sequence (underlined) represent SphI 
restriction enzyme sites within head-to-tail circular AAV genomes cloned with 
the AV-GFP3ori shuttle virus. 

Figure 1 1. Chemical sequence homology of three AAV circular 
intermediates with various conformations of ITR arrays (SEQ ID NO:4, SEQ ID 
20 NO:5 and SEQ ID NO:6). Diversity in ITR arrays are evident from the non- 
conserved bases marked in lower case. The ends of the sequence (underlined) 
represent SphI restriction enzyme sites within head-to-tail circular AAV 
genomes cloned with the AV.GFP3ori shuttle virus. 

Figure 12 A. Palindromic repeat structure derived from chemical 
25 sequencing of AAV circular intermediate isolate p81. Secondary structure of the 
sense strand is depicted in the top box with plasmid reference given below. 

Figure 12B. Palindromic repeat structure derived from chemical 
sequencing of AAV circular intermediate isolate p79. Secondary structure of the 
sense strand is depicted in the top box with plasmid reference given below. 
30 Figure 12C. Palindromic repeat structure derived from chemical 

sequencing of AAV circular intermediate isolate p79. Secondary structure of the 
sense strand is depicted in the top box with plasmid reference given below. 
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Figure 13. Persistence of GFP expression in developing Xenopus 
embryos microinjected with AAV circular intermediate isolate p81. The extent 
of GFP fluorescence in tadpoles reflects the stability of episomal or integrated 
microinjected plasmids. Bright field image on the left is of the p81 injected 
5 embryo. The p8i injected embryo depicts fluorescence in nearly all cells by one 
week post-injection. In contrast, a mosaic pattern of expression in a minority of 
cells in pCisAV.GFPori injected embryos. The pCisAV.GFPori plasmid 
contains the identical promoter sequences driving GFP gene expression and two 
ITRs separated by stuffer sequence. These findings demonstrate that specific 
10 structural characteristics found within AAV circular intermediates are 
responsible for increased persistence of transgene expression. 

Figure 14. Mechanistic scheme for determining pathways for rAAV 
circular concatamer formation. The two independent vectors used in these 
studies, AV.Alkphos and AV.GFP3.ori, are shown in Panel A . Restriction sites 
1 5 important in the structural analysis of circular intermediates are also shown. In 
Panel B, a schematic representation of two potential models for circular 
concatamer formation is depicted, along with the methods to experimentally 
differentiate which of these processes is active in muscle. Following co- 
infection of the tibialis muscle with AV.Alkphos and AV.GFP3.ori, all 
20 subsequently rescued plasmids arise solely from circular intermediates 
containing AV.GFP3ori genomes. If rolling circular replication is the sole 
mechanism of concatamerization, only GFP expressing plasmids should be 
rescued. In contrast, if intermolecular recombination between independently 
formed monomer circular intermediates is the mechanism of concatamerization, 
25 both GFP and GFP/Alkphos expressing plasmids should be rescued. 

Figure 15. Co-infection of tibialis muscle of mice with AV.Alkphos and 
AV.GFP3ori. Transgene expression of rAAV infected tibialis muscle was 
determined at 14, 35, 80 (Panels A and A'), and 120 (Panels B-D) days 
following co-infection with 5 x 10 9 DNA particles each of AV.Alkphos and 
30 AV.GFP3ori. The time course of transgene expression started around 14 days 
and peaked by 35-80 days. The extent of co-infection of myofibers with both 
Alkphos and GFP rAAV was determined in serial sections of 80 and 120 day 
post-infection muscle samples. Panels A-C represent GFP fluorescence of 
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formalin fixed, cryoprotected sections, while panels A'-C depict the 
histochemical staining for Alkaline phosphatase in adjacent serial sections. A 
short staining time (7 minutes) was necessary to observe variation in staining 
levels for comparison to GFP. It was found that longer staining times 
5 (30 minutes) saturated the Alkphos signal. The boxed region in panels B and B' 
are enlarged in panels C and C, respectively. A more precise correlation of GFP 
and Alkphos staining in myofibers is given in Panel D in which co-localization 
of GFP and Alkphos expression was examined in the same section of a 120 day 
post-infected sample. This was performed by photographing the GFP 

10 fluorescent image prior to staining for Alkphos activity. The left panel of D 
shows a high power Nomarski photomicrograph of a group of myofibers (traced 
in red) , while the corresponding GFP and Alkphos staining patterns are shown 
in the right panel. Photomicrographs of Alkphos staining were taken with a red 
filter to allow for superimposition of staining patterns with GFP fluorescence. 

1 5 Co-expression of Alkphos and GFP is shown within myofibers as a 

yellow/orange color. Myofibers are marked as follows: (-) negative for both 
Alkphos and GFP, (*) positive for only GFP, and (+) positive for both GFP and 
Alkphos. 

Figure 16. Rescue of circular intermediates and characterization of DNA 
20 hybridization patterns. Using the ampicillin resistance gene (amp) and bacterial 
ori incorporated into the AV.GFP3ori vector, the extent of circular intermediate 
formation was assessed by rescuing amp resistant plasmids following 
transformation of 1/5 the isolated Hirt DNA into E. coli Sure cells. Twenty 
plasmids from each muscle sample were prepared and analyzed by slot blot 
25 hybridization against GFP, Alkphos, and Amp 32 P-labeled DNA probes. A 

representative group demonstrating the hybridization patterns is shown in Panel 
A. Panel B depicts the mean (+/-SEM) number of rescued bacterial plasmids 
that hybridized to either GPF alone, or to both GFP and Alkphos probes, 
following transformation of l/5 lh of the Hirt DNA. These numbers were 
30 calculated from the percentage of plasmids hybridizing to GPF and/or Alkphos 
and the total CFU plating efficiency derived from the original transformation. In 
total, 3 independent muscle samples were analyzed for a total of 60 plasmids at 
each time point. The percentage of GFP hybridization positive rescued plasmids 
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that also demonstrated hybridization to Alkphos is shown in Panel C. These data 
demonstrate an increase in the abundance of rescued GFP/Alkphos co-encoding 
circular intermediates over time. 

Figure 17. Transgene expression from rescued circular intermediates. 
5 Rescued circular intermediate plasmids were transfected into 293 cells for 
assessment of their ability to express encoded transgenes. In these studies all 
GFP hybridization positive clones from at least two muscles were tested for each 
time point and scored for their ability to express GFP and Alkaline phosphatase. 
In total at least 40 clones were evaluated for each time point. Three patterns of 
10 transgene expression were observed following transfection of these plasmids: I) 
no gene expression (Panel A), II) GFP expression only (Panel B), and III) GFP 
and Alkphos expression (Panel C). Panels A-C depict Nomareki 
photomicrographs (left) of GFP fluorescent fields (center) and Alkphos staining 
of a different field from the same culture (right). The percentage of GFP 
1 5 hybridization positive clones that also expressed GFP is shown in Panel D. 

Additionally, this panel illustrates the percentage of GFP expressing clones also 
expressing Alkphos. 

Figure 18. Structural analysis of bi-functional concatamer circular 
intermediates. To fully characterize the nature of GFP and Alkphos co- 
20 expressing circular intermediates, detailed structural analyses were performed 
using restriction enzyme mapping and Southern blot hybridization with GFP, 
Alkphos, and ITR 32 P-labeled probes. Results from Southern blot analysis of 
plasmid clone #33 (Panel A) and clone #5 (Panel C) are given as representative 
examples of circular intermediates isolated from 80 and 35 day Hirt DNA of 
25 rAAV infected muscle, respectively. Agarose gels were run in triplicate for each 
of these clones and Southern blot filters were hybridized with one of the three 
DNA probes as indicated below each autoradiogram. Molecular weights (kb) are 
indicated to the left of the ethidium stained agarose gel and restriction enzymes 
are marked on the top of each gel/filter. Panels B and D give the deduced 
30 structure of plasmid clones #33 and #5, respectively, as based on Southern blot 
analysis. For ease of comparison with the restriction maps of the viral genomes 
given in Figure 14A, the position of restriction enzyme sites (kb) are marked 
with the indicated orientation of intact viral genomes. However, in clone #33 a 
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deletion occurred between the Asel and Hindlll site of a head-to-tail array 
between AV.Alkphos and AV.GFP3ori, as reflected by a 900 bp reduction in the 
anticipated size of HindHI/Notl and Clal/Asel fragments (marked by asterisks in 
Panel A). Furthermore, the SphI site flanking an ITR was ablated in clone #5 
5 (bands effected by this deletion are marked by asterisks in Panel C). The 

deletion is not reflected in the overall concatamer since the exact region involved . 
and/or the size of the deletion is unclear. Additionally, chemical sequence 
evidence of rescued circular intermediates suggests that the predominant form of 
ITR arrays may be in a double-D structure (i.e., one ITR flanked by two D- 

10 sequence rather than two ITRs) and hence ITR arrays containing fragments may 
appear 147 bp shorter than indicated. However, to more easily depict the 
orientation of viral genomes, the position of 5 'and 3' ITRs is indicated rather 
than representing a single ITR at these junctions. 

Figure 19. Application of rAAV circular concatamers to deliver trans- 

15 splicing vectors with large gene inserts. Panel A depicts two rAAV vectors 
encoding two halves of a cDNA (red) and flanked by splice site consensus 
sequences (brown). Panel B depicts one potential type of intermolecular 
concatamer following co-infection of cells with the independent vectors shown 
in panel A. Full length transgene mRNA can then be produced by splicing. 

20 Panel C depicts two rAAV vectors encoding two halves of a CFTR DNA flanked 
by a promoter and splice donor or a splice acceptor and a poly A sequence, 
respectively. Panel D shows one potential type of intermolecular concatamer 
following co-infection of cells with the vectors in Panel C. 

Figure 20. Schematic representation of the rAAV vectors used for cis- 

25 activation. 

Figure 21. Strategy for enhancing rAAV gene expression through 
intermolecular c/s-activation. Two independent rAAV viruses, one encoding a 
transgene with or without a minimal promoter (e.g., AV.S V(P)Luc) and another 
harboring enhancer sequences (e.g., AV.SupEnh), were used to co-infect the 
30 same tissue. Subsequent concatamerization between two rAAV vectors 
substantially augments expression of the transgene, due to the presence of 
enhancer elements within the same circularized molecule. 
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Figure 22. Intermolecular m-activation increases rAAV mediated gene 
transfer in fibroblasts. Human fibroblast cells were infected with the indicated 
rAAV vector(s) at an moi of 1000 for each individual vector. Luciferase activity 
was examined at 3 days post-infection. The data represent the mean +/- SEM of 
5 6 independent samples for each experimental condition. 

Figure 23. Intermolecular cis-activation increases rAAV mediated gene 
transfer to muscle in vivo. Mouse tibialis anterior muscles were infected with the 
indicated rAAV vectors) at 2 x 10 10 particles per viral vector in a total volume of 
30 [ll PBS. The luciferase activity in rAAV infected or mock infected (PBS) 

10 muscles was examined at 30 days (Panel A) and 90 days (Panel B) post- 
infection. The data represent the mean +/- SEM of 6 independent muscle 
samples for each experimental condition. Co-administration of the AV.SupEnh 
vector harboring enhancer elements substantially enhanced rAAV mediated 
luciferase expression in muscle from both the ITR and the minimal S V40 

15 promoter. 

Figure 24. Viral constructs for the generation of autonomously 
replicating rAAV vectors as circular concatamers. Panel A depicts two rAAV 
constructs used to test this hypothesis. One encodes the GFP transgene (green) 
and the other encodes the EBNA-1 (red) and OriP (purple) sequences necessary 
20 for autonomous replication. Additionally, sequences encoded within the GFP 
vector allow for rescue of circular intermediates in bacteria. Panel B depicts one 
potential type of intermolecular concatamer following co-infection of the 
independent vectors shown in panel A. 

Figure 25. rAAV vectors used to generate a /ra/w-splicing vector 
25 expressing genomic epo DNA. Panel A shows a schematic of the vectors. An 
IRES sequence and EGFP gene are included in one of the vectors to allow for 
direct visualization of transgene expression. Panel B depicts a potential circular 
concatamer formed after co-infection. The dashed lines indicate the splicing 
pattern. Panel C shows the hnRNA, splicing pattern and mature mRNA 
30 transcripts which result from circular concatamerization of the two vectors. 

Figure 26. Production of full length Epo protein following co-infection 
of primary fibroblasts with two independent tows-splicing vectors. Confluent 
primary fibroblasts (8 x 10 5 cells) were infected with 7 x 10 9 particles of each 
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AV.Epol and/or AV.Epo2. .Epo expression was monitored by ELISA (R&D 
Systems, Minneapolis, MN) by harvesting culture media 24 hours following 
media replacement at the indicated time points. Results are presented as the 
mean Epo level (n = 3) normalized to 1 x 10 s cells. 
5 Figure 27. Functional expression of human Epo in vivo using trans- 

splicing AAV vectors. Hematocrits of the C57BL/6 mice were determined at 10, 
21, 35, 48, 63, 80, and 94 days following infection with 3 x 10 10 particles of 
each independent vector either together in one tibialis muscle (n = 6, denoted as 
solid squares) or independently by administration of AV.Epol to the right 
1 0 tibialis muscle and AV.Epo2 to the left tibialis muscle of the same mouse (n = 4, 
denoted by solid triangles). As a control for baseline, the serum of uninfected 
mice (n = 4, denoted by open circles) was assayed. 

Detailed D esr ription of the invent ion 

15 Definitions 

As used herein, the terms "isolated and/or purified" refer to in vitro 
preparation, isolation and/or purification of a nucleic acid molecule of the 
invention, so that it is not associated with in vivo substances. 

As used herein, a DNA molecule, sequence or segment of the invention 
20 preferably is biologically active. A biologically active DNA molecule of the 
invention has at least about 1%, more preferably at least about 10%, and more 
preferably at least about 50%, of the activity of a DNA molecule comprising ITR 
sequences from a circular intermediate of AAV, e.g., a DNA molecule 
comprising SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, or a 
25 subunit or variant thereof. The activity of a nucleic acid molecule of the 

invention can be measured by methods well known to the art, some of which are 
described hereinbelow. For example, the presence of the DNA molecule in a 
recombinant nucleic acid molecule in a host cell results in episomal persistence 
and/or increased abundance of the recombinant molecule in those cells relative to 
30 corresponding cells having a recombinant nucleic acid molecule lacking a DNA 
molecule of the invention. 

A variant DNA molecule, sequence or segment of the invention has at 
least about 70%, preferably at least about 80%, and more preferably at least 
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about 90%, but less than 100%, contiguous nucleotide sequence homology or 
identity to a DNA molecule comprising ITR sequences from a circular 
intermediate of AAV, e.g., SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ 
ID NO:6, a subunit thereof A variant DNA molecule of the invention may 
5 include nucleotide bases not present in SEQ ID NO:3, SEQ ID NO:4, SEQ ID 
NO:5, SEQ ID NO:6, e.g., 5', 3' or internal deletions or insertions, such as the 
insertion of a restriction endonuclease recognition site, so long as these bases do 
not substantially reduce the biological activity of the molecule. A substantial 
reduction in activity means a reduction in activity of greater than about 50%, 
10 preferably greater than about 90%. 

I. Identification of Nucleic Acid Molecules Falling Within the Scope of the 

Invention 

A._Nucleic Acid Molecules of the Invention 

1 . Sources of the Nucleic Acid Molecules of the Invention 

1 5 Sources of nucleotide sequences from which the present nucleic acid 

molecules can be obtained include AAV infected cells, e.g., any vertebrate, 
preferably mammalian, cellular source. 

As used herein, the terms "isolated and/or purified" refer to in vitro 
isolation of a nucleic acid, e.g., DNA molecule from its natural cellular 

20 environment, and from association with other components of the cell, such as 
nucleic acid or polypeptide, so that it can be sequenced, replicated, and/or 
expressed. For example, "isolated nucleic acid" is RNA or DNA containing 
greater than about 50, preferably about 300, and more preferably about 500 or 
more, sequential nucleotide bases that comprise a DNA segment from a circular 

25 intermediate of AAV which contains at least a portion of the 5' and 3' ITRs and 
the D sequence, or a variant thereof, that is complementary or hybridizes, 
respectively, to AAV ITR DNA and remains stably bound under stringent 
conditions, as defined by methods well known in the art, e.g., in Sambrook et ah, 
1989. Thus, the RNA or DNA is "isolated" in that it is free from at least one 

30 contaminating nucleic acid with which it is normally associated in the natural 
source of the RNA or DNA and is preferably substantially free of any other 
mammalian RNA or DNA. The phrase "free from at least one contaminating 
source nucleic acid with which it is normally associated" includes the case where 
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the nucleic acid is reintroduced into the source or natural cell but is in a different 
chromosomal location or is otherwise flanked by nucleic acid sequences not 
normally found in the source cell, e.g., in a vector or plasmid. An example of 
isolated nucleic acid within the scope of the invention is nucleic acid that shares 
5 at least about 80%, preferably at least about 90%, and more preferably at least 
about 95%, sequence identity with SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5 
or SEQ ID NO:6, or a subunit thereof. 

As used herein, the term "recombinant nucleic acid" or "preselected 
nucleic acid," e.g., "recombinant DNA sequence or segment" or "preselected 
10 DNA sequence or segment" refers to a nucleic acid, e.g., to DNA, that has been 
derived or isolated from any appropriate cellular source, that may be 
subsequently chemically altered in vitro, so that its sequence is not naturally 
occurring, or corresponds to naturally occurring sequences that are not 
positioned as they would be positioned in a genome which has not been 
15 transformed with exogenous DNA. An example of preselected DNA "derived" 
from a source, would be a DNA sequence that is identified as a useful fragment 
within a given organism, and which is then chemically synthesized in essentially 
pure form. An example of such DNA "isolated" from a source would be a useful 
DNA sequence that is excised or removed from said source by chemical means, 
20 e.g., by the use of restriction endonucleases, so that it can be further 

manipulated, e.g., amplified, for use in the invention, by the methodology of 
genetic engineering. 

Thus, recovery or isolation of a given fragment of DNA from a restriction 
digest can employ separation of the digest on polyacrylamide or agarose gel by 
25 electrophoresis, identification of the fragment of interest by comparison of its 
mobility versus that of marker DNA fragments of known molecular weight, 
removal of the gel section containing the desired fragment, and separation of the 
gel from DNA. See Lawn et al., Nucleic Acids % 6103 (1981), and 
Goeddel et al., Nucleic Acids Res., S, 4057 (1980). Therefore, "preselected 
30 DNA" includes completely synthetic DNA sequences, semi-synthetic DNA 
sequences, DNA sequences isolated from biological sources, and DNA 
sequences derived from RNA, as well as mixtures thereof. 
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Nucleic acid molecules having base pair substitutions (i.e., variants) are 
prepared by a variety of methods known in the art. These methods include, but 
are not limited to, isolation from a natural source (in the case of naturally 
occurring sequence variants) or preparation by oligonucleotide-mediated (or site- 
5 directed) mutagenesis, PCR mutagenesis, and cassette mutagenesis of an earlier 
prepared variant or a non-variant version of the nucleic acid molecule. 

Oligonucleotide-mediated mutagenesis is a preferred method for 
preparing substitution variants. This technique is well known in the art as . 
described by Adelman et al., DNA, 2, 183 (1983). Briefly, AAV DNA is altered 

1 0 by hybridizing an oligonucleotide encoding the desired mutation to a DNA 
template, where the template is the single-stranded form of a plasmid or 
bacteriophage containing the unaltered or native DNA sequence of AAV. After 
hybridization, a DNA polymerase is used to synthesize an entire second 
complementary strand of the template that will thus incorporate the 

15 oligonucleotide primer, and will code for the selected alteration in the AAV 
DNA. 

Generally, oligonucleotides of at least 25 nucleotides in length are used. 
An optimal oligonucleotide will have 12 to 15 nucleotides that are completely 
complementary to the template on either side of the nucleotide(s) coding for the 

20 mutation. This ensures that the oligonucleotide will hybridize properly to the 
single-stranded DNA template molecule. The oligonucleotides are readily 
synthesized using techniques known in the art such as that described by Crea et 
al., Pro c . Na tl. A cad. Sci. U.S.A., 25, 5765 (1978). 

The DNA template can be generated by those vectors that are either 

25 derived from bacteriophage Ml 3 vectors (the commercially available Ml 3mpl 8 
and M13mpl9 vectors are suitable), or those vectors that contain a single- 
stranded phage origin of replication as described by Viera et al., Meth. Enzymol. , 
153, 3 (1987). Thus, the DNA that is to be mutated may be inserted into one of 
these vectors to generate single-stranded template. Production of the single- 

30 stranded template is described in Sections 4.21-4.41 of Sambrook et al., 

Molecular Cloning- A Laboratory Manual (Cold Spring Harbor Laboratory 
Press, N.Y. 1989). 
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Alternatively, single-stranded DNA template may be generated by 
denaturing double-stranded plasmid (or other) DNA using standard techniques. 

For alteration of the native DNA sequence (to generate amino acid 
sequence variants, for example), the oligonucleotide is hybridized to the single- 
5 stranded template under suitable hybridization conditions. A DNA polymerizing 
enzyme, usually the Klenow fragment of DNA polymerase I, is then added to 
synthesize the complementary strand of the template using the oligonucleotide as 
a primer for synthesis. A heteroduplex molecule is thus formed such that one 
strand of DNA encodes the mutated form of AAV, and the other strand (the 
10 original template) encodes the native, unaltered sequence of AAV. This 

heteroduplex molecule is then transformed into a suitable host cell, usually a 
prokaryote such as E. coli JM101. After the cells are grown, they are plated onto 
agarose plates and screened using the oligonucleotide primer radiolabeled with 
32-phosphate to identify the bacterial colonies that contain the mutated DNA. 
15 The mutated region is then removed and placed in an appropriate vector, 

generally an expression vector of the type typically employed for transformation 
of an appropriate host. 

The method described immediately above may be modified such that a 
homoduplex molecule is created wherein both strands of the plasmid contain the 
20 mutations(s). The modifications are as follows: The single-stranded 

oligonucleotide is annealed to the single-stranded template as described above. 
A mixture of three deoxyribonucleotides, deoxyriboadenosine (dATP), 
deoxyriboguanosine (dGTP), and deoxyribothymidine (dTTP), is combined with 
a modified thiodeoxyribocytosine called dCTP-(ccS) (which can be obtained 
25 from the Amersham Corporation). This mixture is added to the template- 

oligonucleotide complex. Upon addition of DNA polymerase to this mixture, a 
strand of DNA identical to the template except for the mutated bases is 
generated. In addition, this new strand of DNA will contain dCTP-(aS) instead 
of dCTP, which serves to protect it from restriction endonuclease digestion. 
30 After the template strand of the double-stranded heteroduplex is nicked 

with an appropriate restriction enzyme, the template strand can be digested with 
ExoIII nuclease or another appropriate nuclease past the region that contains the 
site(s) to be mutagenized. The reaction is then stopped to leave a molecule that 
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is only partially single-stranded. A complete double-stranded DNA homoduplex 
is then formed using DNA polymerase in the presence of all four 
deoxyribonucleotide triphosphates, ATP, and DNA ligase. This homoduplex 
molecule can then be transformed into a suitable host cell such as E. coli JM101. 
5 For example, a preferred embodiment of the invention is an isolated and 

purified DNA molecule comprising a DNA segment comprising SEQ ID NO:3, 
SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, a subunit thereof or a variant 
thereof having nucleotide substitutions, or deletions or insertions. 
II. Preparation of Molecules Useful to Practice the Methods of the Invention 

10 A. Nucleic Acid Molecules 

1 Chime ric Expression Cassettes 

To prepare expression cassettes for transformation herein, the 
recombinant or preselected DNA sequence or segment may be circular or linear, 
double-stranded or single-stranded. Generally, the preselected DNA sequence or 

15 segment is in the form of chimeric DNA, such as plasmid DNA, that can also 
contain coding regions flanked by control sequences which promote the 
expression of the preselected DNA present in the resultant cell line. 

As used herein, "chimeric" means that a vector comprises DNA from at 
least two different species, or comprises DNA from the same species, which is 

20 linked or associated in a manner which does not occur in the "native" or wild 
type of the species. 

Aside from the preselected DNA sequences described above, a portion of 
the preselected DNA may serve a regulatory or a structural function. For 
example, the preselected DNA may itself comprise a promoter that is active in 

25 mammalian cells, or may utilize a promoter already present in the genome that is 
the transformation target. Such promoters include the CMV promoter, as well as 
the SV40 late promoter and retroviral LTRs (long terminal repeat elements), 
although many other promoter elements well known to the art may be employed 
in the practice of the invention. 

30 Other elements functional in the host cells, such as introns, enhancers, 

polyadenylation sequences and the like, may also be a part of the preselected 
DNA. Such elements may or may not be necessary for the function of the DNA, 
but may provide improved expression of the DNA by affecting transcription, 
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stability of the mRNA, or the like. Such elements may be included in the DNA 
as desired to obtain the optimal performance of the transforming DNA in the 
cell. 

"Control sequences" is defined to mean DNA sequences necessary for 
5 the expression of an operably linked coding sequence in a particular host 
organism. The control sequences that are suitable for prokaryotic cells, for 
example, include a promoter, and optionally an operator sequence, and a 
ribosome binding site. Eukaryotic cells are known to utilize promoters, 
polyadenylation signals, and enhancers. 
1 0 "Operably linked" is defined to mean that the nucleic acids are placed in 

a functional relationship with another nucleic acid sequence. For example, DNA 
for a presequence or secretory leader is operably linked to DNA for a peptide or 
polypeptide if it is expressed as a preprotein that participates in the secretion of 
the peptide or polypeptide; a promoter or enhancer is operably linked to a coding 
1 5 sequence if it affects the transcription of the sequence; or a ribosome binding site 
is operably linked to a coding sequence if it is positioned so as to facilitate 
translation. Generally, "operably linked" means that the DNA sequences being 
linked are contiguous and, in the case of a secretory leader, contiguous and in 
reading phase. However, enhancers do not have to be contiguous. Linking is 
20 accomplished by ligation at convenient restriction sites. If such sites do not 
exist, the synthetic oligonucleotide adaptors or linkers are used in accord with 
conventional practice. 

The preselected DNA to be introduced into the cells further will generally 
contain either a selectable marker gene or a reporter gene or both to facilitate 
25 identification and selection of transformed cells from the population of cells 

sought to be transformed. Alternatively, the selectable marker may be carried on 
a separate piece of DNA and used in a co-transfonnation procedure. Both 
selectable markers and reporter genes may be flanked with appropriate 
regulatory sequences to enable expression in the host cells. Useful selectable 
30 markers are well known in the art and include, for example, antibiotic and 

herbicide-resistance genes, such as neo t hpt, dhfr, bar, aroA, dapA and the like. 
See also, the genes listed on Table 1 of Lundquist et al. (U.S. Patent No. 
5,848,956). 
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Reporter genes are used for identifying potentially transformed cells and 
for evaluating the functionality of regulatory sequences. Reporter genes which 
encode for easily assayable proteins are well known in the art. In general, a 
reporter gene is a gene which is not present in or expressed by the recipient 
5 organism or tissue and which encodes a protein whose expression is manifested 
by some easily detectable property, e.g., enzymatic activity. Preferred genes 
include the chloramphenicol acetyl transferase gene (cat) from Tn9 of E. coli, the 
beta-glucuronidase gene (gus) of the uidA locus of E. coli, and the luciferase 
gene from firefly Photinus pyralis. Expression of the reporter gene is assayed at 
10 a suitable time after the DNA has been introduced into the recipient cells. 

The general methods for constructing recombinant DNA which can 
transform target cells are well known to those skilled in the art, and the same 
compositions and methods of construction may be utilized to produce the DNA 
useful herein. For example, J. Sambrook et al., Molecular Cloning- A 
15 Laboratory Manual, Cold Spring Harbor Laboratory Press (2d ed., 1989), 
provides suitable methods of construction. 
2. Transformation into Host OJk 

The recombinant DNA can be readily introduced into the host cells, e.g., 
mammalian, bacterial, yeast or insect cells by transfection with an expression 
20 vector of the invention, by any procedure useful for the introduction into a 
particular cell, e.g., physical or biological methods, to yield a transformed cell 
having the recombinant DNA stably integrated into its genome or present as an 
episome which can persist in the transformed cells, so that the DNA molecules, 
sequences, or segments, of the present invention are maintained and/or expressed 
25 by the host cell. 

Physical methods to introduce a preselected DNA into a host cell include 
calcium phosphate precipitation, lipofection, particle bombardment, 
microinjection, electroporation, and the like. Biological methods to introduce 
the DNA of interest into a host cell include the use of DNA and RNA viral 
30 vectors. The main advantage of physical methods is that they are not associated 
with pathological or oncogenic processes of viruses. However, they are less 
precise, often resulting in multiple copy insertions, random integration, 
disruption of foreign and endogenous gene sequences, and unpredictable 
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expression. 

As used herein, the term "cell line" or "host cell" is intended to refer to 
well-characterized homogenous, biologically pure populations of cells. These 
cells may be eukaryotic cells that are neoplastic or which have been 
5 "immortalized" in vitro by methods known in the art, as well as primary cells, or 
prokaryotic cells. The cell line or host cell is preferably of mammalian origin, 
but cell lines or host cells of non-mammalian origin may be employed, including 
plant, insect, yeast, fungal or bacterial sources. Generally, the preselected DNA 
sequence is related to a DNA sequence which is resident in the genome of the 
10 host cell but is not expressed, or not highly expressed, or, alternatively, 
overexpressed. 

"Transfected" or "transformed" is used herein to include any host cell or 
cell line, the genome of which has been altered or augmented by the presence of 
at least one preselected DNA sequence, which DNA is also referred to in the art 

1 5 of genetic engineering as "heterologous DNA," "recombinant DNA," 

"exogenous DNA," "genetically engineered," "non-native," or "foreign DNA," 
wherein said DNA was isolated and introduced into the genome of the host cell 
or cell line by the process of genetic engineering. The host cells of the present 
invention are typically produced by transfection with a DNA sequence in a 

20 plasmid expression vector, a viral expression vector, or as an isolated linear 
DNA sequence. 

To confirm the presence of the preselected DNA sequence in the host 
cell, a variety of assays may be performed. Such assays include, for example, 
"molecular biological" assays well known to those of skill in the art, such as 

25 Southern and Northern blotting, RT-PCR and PCR; "biochemical" assays, such 
as detecting the presence of a polypeptide expressed from a gene present in the 
vector, e.g., by immunological means (immunoprecipitations, immunoaffinity 
columns, ELISAs and Western blots) or by any other assay useful to identify 
molecules falling within the scope of the invention. 

30 To detect and quantitate RNA produced from introduced DNA segments, 

RT-PCR may be employed. In this application of PCR, it is first necessary to 
reverse transcribe RNA into DNA, using enzymes such as reverse transcriptase, 
and then through the use of conventional PCR techniques amplify the DNA. In 
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most instances PCR techniques, while useful, will not demonstrate integrity of 
the RNA product. Further information about the nature of the RNA product may 
be obtained by Northern blotting. This technique demonstrates the presence of 
an RNA species and gives information about the integrity of that RNA. The 
5 presence or absence of an RNA species can also be determined using dot or slot 
blot Northern hybridizations. These techniques are modifications of Northern 
blotting and only demonstrate the presence or absence of an RNA species. 

While Southern blotting and PCR may be used to detect the DNA 
segment in question, they do not provide information as to whether the DNA 
1 0 segment is being expressed. Expression may be evaluated by specifically 
identifying the polypeptide products of the introduced DNA sequences or 
evaluating the phenotypic changes brought about by the expression of the 
introduced DNA segment in the host cell. 
IIT . Dosapes , Formul a tions and R outes r>f AHm inistratmn 
1 5 Adrninistration of a nucleic acid molecule may be accomplished through 

the introduction of cells transformed with the nucleic acid molecule (see, for 
example, WO 93/02556), the administration of the nucleic acid molecule itself 
(see, for example, Feigner et al., U.S. Patent No. 5,580,859, Pardoll et al., 
Immunity , 1, 165 (1995); Stevenson et al., Immunol R^y ^ 211 (1995); 
20 Moiling, J . Mol . Mcd , 71, 242 (1997); Donnelly et al.. Ann N V AraH R r i 
222, 40 (1995); Yang et al., Mol. Mfid Today, 2, 476 (1996); Abdallah et al., 
B i ()1 - rRl1 ' 1 (1995)), through infection with a recombinant virus or via 
liposomes. Pharmaceutical formulations, dosages and routes of administration 
for nucleic acids are generally disclosed, for example, in Feigner et al., supra. 
25 Administration of the therapeutic agents in accordance with the present 

invention may be continuous or intermittent, depending, for example, upon the 
recipient's physiological condition, whether the purpose of the administration is 
therapeutic or prophylactic, and other factors known to skilled practitioners. The 
administration of the agents of the invention may be essentially continuous over 
30 a preselected period of time or may be in a series of spaced doses. Both local 
and systemic administration is contemplated. When the molecules of the 
invention are employed for prophylactic purposes, agents of the invention are 
amenable to chronic use, preferably by systemic administration. 
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One or more suitable unit dosage forms comprising the therapeutic agents 
of the invention, which, as discussed below, may optionally be formulated for 
sustained release, can be administered by a variety of routes including oral, or 
parenteral, including by rectal, transdermal, subcutaneous, intravenous, 
5 intramuscular, intraperitoneal, intrathoracic, intrapulmonary and intranasal 
routes. The formulations may, where appropriate, be conveniently presented in 
discrete unit dosage forms and may be prepared by any of the methods well 
known to pharmacy. Such methods may include the step of bringing into 
association the therapeutic agent with liquid carriers, solid matrices, semi-solid 
10 carriers, finely divided solid carriers or combinations thereof, and then, if 

necessary, introducing or shaping the product into the desired delivery system. 

When the therapeutic agents of the invention are prepared for oral 
administration, they are preferably combined with a pharmaceutical ly acceptable 
carrier, diluent or excipient to form a pharmaceutical formulation, or unit dosage 
15 form. The total active ingredients in such formulations comprise from 0.1 to 
99.9% by weight of the formulation. By "pharmaceutically acceptable" it is 
meant the carrier, diluent, excipient, and/or salt must be compatible with the 
other ingredients of the formulation, and not deleterious to the recipient thereof. 
The active ingredient for oral administration may be present as a powder or as 
20 granules; as a solution, a suspension or an emulsion; or in achievable base such 
as a synthetic resin for ingestion of the active ingredients from a chewing gum. 
The active ingredient may also be presented as a bolus, electuary or paste. 

Pharmaceutical formulations containing the therapeutic agents of the 
invention can be prepared by procedures known in the art using well known and 
25 readily available ingredients. For example, the agent can be formulated with 
common excipients, diluents, or carriers, and formed into tablets, capsules, 
suspensions, powders, and the like. Examples of excipients, diluents, and 
carriers that are suitable for such formulations include the following fillers and 
extenders such as starch, sugars, mannitol, and silicic derivatives; binding agents 
30 such as carboxymethyl cellulose, HPMC and other cellulose derivatives, 
alginates, gelatin, and polyvinylpyrrolidone; moisturizing agents such as 
glycerol; disintegrating agents such as calcium carbonate and sodium 
bicarbonate; agents for retarding dissolution such as paraffin; resorption 
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accelerators such as quaternary ammonium compounds; surface active agents 
such as cetyl alcohol, glycerol monostearate; adsorptive carriers such as kaolin 
and bentonite; and lubricants such as talc, calcium and magnesium stearate, and 
solid polyethyl glycols. 
5 For example, tablets or caplets containing the agents of the invention can 

include buffering agents such as calcium carbonate, magnesium oxide and 
magnesium carbonate. Caplets and tablets can also include inactive ingredients 
such as cellulose, pregelatinized starch, silicon dioxide, hydroxy propyl methyl 
cellulose, magnesium stearate, microcrystalline cellulose, starch, talc, titanium 
10 dioxide, benzoic acid, citric acid, com starch, mineral oil, polypropylene glycol, 
sodium phosphate, and zinc stearate, and the like. Hard or soft gelatin capsules 
containing an agent of the invention can contain inactive ingredients such as 
gelatin, microcrystalline cellulose, sodium lauryl sulfate, starch, talc, and 
titanium dioxide, and the like, as well as liquid vehicles such as polyethylene 
1 5 glycols (PEGs) and vegetable oil. Moreover, enteric coated caplets or tablets of 
an agent of the invention are designed to resist disintegration in the stomach and 
dissolve in the more neutral to alkaline environment of the duodenum. 

The therapeutic agents of the invention can also be formulated as elixirs 
or solutions for convenient oral administration or as solutions appropriate for 
20 parenteral administration, for instance by intramuscular, subcutaneous or 
intravenous routes. 

The pharmaceutical formulations of the therapeutic agents of the 
invention can also take the form of an aqueous or anhydrous solution or 
dispersion, or alternatively the form of an emulsion or suspension. 
25 Thus, the therapeutic agent may be formulated for parenteral 

administration (e.g., by injection, for example, bolus injection or continuous 
infusion) and may be presented in unit dose form in ampules, pre-filled syringes, 
small volume infusion containers or in multi-dose containers with an added 
preservative. The active ingredients may take such forms as suspensions, 
30 solutions, or emulsions in oily or aqueous vehicles, and may contain formulatory 
agents such as suspending, stabilizing and/or dispersing agents. Alternatively, 
the active ingredients may be in powder form, obtained by aseptic isolation of 
sterile solid or by lyophilization from solution, for constitution with a suitable 
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vehicle, e.g., sterile, pyrogen-free water, before use. 

These formulations can contain pharmaceutical^ acceptable vehicles and 
adjuvants which are well known in the prior art. It is possible, for example, to 
prepare solutions using one or more organic solvent(s) that is/are acceptable 
5 from the physiological standpoint, chosen, in addition to water, from solvents 
such as acetone, ethanol, isopropyl alcohol, glycol ethers such as the products 
sold under the name "Dowanol", polyglycols and polyethylene glycols, C r C 4 
alkyl esters of short-chain acids, preferably ethyl or isopropyl lactate, fatty acid 
triglycerides such as the products marketed under the name "Miglyol", isopropyl 
10 myristate, animal, mineral and vegetable oils and polysiloxanes. 

The compositions according to the invention can also contain thickening 
agents such as cellulose and/or cellulose derivatives. They can also contain 
gums such as xanthan, guar or carbo gum or gum arabic, or alternatively 
polyethylene glycols, bentones and montmorillonites, and the like. 
15 It is possible to add, if necessary, an adjuvant chosen from antioxidants, 

surfactants, other preservatives, film-forming, keratolytic or comedolytic agents, 
perfumes and colorings. Also, other active ingredients may be added, whether 
for the conditions described or some other condition. 

For example, among antioxidants, t-butylhydroquinone, butylated 
20 hydroxyanisole, butylated hydroxytoluene and a-tocopherol and its derivatives 
may be mentioned. The galenical forms chiefly conditioned for topical 
application take the form of creams, milks, gels, dispersion or microemulsions, 
lotions thickened to a greater or lesser extent, impregnated pads, ointments or 
sticks, or alternatively the form of aerosol formulations in spray or foam form or 
25 alternatively in the form of a cake of soap. 

Additionally, the agents are well suited to formulation as sustained 
release dosage forms and the like. The formulations can be so constituted that 
they release the active ingredient only or preferably in a particular part of the 
intestinal or respiratory tract, possibly over a period of time. The coatings, 
30 envelopes, and protective matrices may be made, for example, from polymeric 
substances, such as polylactide-glycolates, liposomes, microemulsions, 
microparticles, nanoparticles, or waxes. These coatings, envelopes, and 
protective matrices are useful to coat indwelling devices, e.g., stents, catheters, 
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peritoneal dialysis tubing, and the like. 

The therapeutic agents of the invention can be delivered via patches for 
transdermal administration. See U.S. Patent No. 5,560,922 for examples of 
patches suitable for transdermal delivery of a therapeutic agent. Patches for 
5 transdermal delivery can comprise a backing layer and a polymer matrix which 
has dispersed or dissolved therein a therapeutic agent, along with one or more 
skin permeation enhancers. The backing layer can be made of any suitable mate- 
rial which is impermeable to the therapeutic agent. The backing layer serves as a 
protective cover for the matrix layer and provides also a support function. The 
10 backing can be formed so that it is essentially the same size layer as the polymer 
matrix or it can be of larger dimension so that it can extend beyond the side of 
the polymer matrix or overlay the side or sides of the polymer matrix and then 
can extend outwardly in a manner that the surface of the extension of the backing 
layer can be the base for an adhesive means. Alternatively, the polymer matrix 
15 can contain, or be formulated of, an adhesive polymer, such as polyacrylate or 
acrylate/vinyl acetate copolymer. For long-term applications it might be desir- 
able to use microporous and/or breathable backing laminates, so hydration or 
maceration of the skin can be minimized. 

Examples of materials suitable for making the backing layer are films of 
20 high and low density polyethylene, polypropylene, polyurethane, 

polyvinylchloride, polyesters such as polyethylene phthalate), metal foils, metal 
foil laminates of such suitable polymer films, and the like. Preferably, the 
materials used for the backing layer are laminates of such polymer films with a 
metal foil such as aluminum foil. In such laminates, a polymer film of the 
25 laminate will usually be in contact with the adhesive polymer matrix. 

The backing layer can be any appropriate thickness which will provide 
the desired protective and support functions. A suitable thickness will be from 
about 10 to about 200 microns. 

Generally, those polymers used to form the biologically acceptable 
30 adhesive polymer layer are those capable of forming shaped bodies, thin walls or 
coatings through which therapeutic agents can pass at a controlled rate. Suitable 
polymers are biologically and pharmaceutically compatible, nonallergenic and 
insoluble in and compatible with body fluids or tissues with which the device is 
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contacted. The use of soluble polymers is to be avoided since dissolution or 
erosion of the matrix by skin moisture would affect the release rate of the 
therapeutic agents as well as the capability of the dosage unit to remain in place 
for convenience of removal. 
5 Exemplary materials for fabricating the adhesive polymer layer include 

polyethylene, polypropylene, polyurethane, ethylene/propylene copolymers, 
ethylene/ethylacrylate copolymers, ethylene/vinyl acetate copolymers, silicone 
elastomers, especially the medical-grade polydimethylsiloxanes, neoprene 
rubber, polyisobutylene, polyacrylates, chlorinated polyethylene, polyvinyl 

10 chloride, vinyl chloride- vinyl acetate copolymer, crosslinked polymethacrylate 
polymers (hydrogel), polyvinylidene chloride, poly(ethylene terephthalate), butyl 
rubber, epichlorohydrin rubbers, ethylenvinyl alcohol copolymers, ethylene- 
vinyloxyethanol copolymers; silicone copolymers, for example, polysiloxane- 
polycarbonate copolymers, polysiloxanepolyethylene oxide copolymers, 

15 polysiloxane-polymethacrylate copolymers, polysiloxane-alkylene copolymers 
(e.g., polysiloxane-ethylene copolymers), polysiloxane-alkylenesilane 
copolymers (e.g., polysiloxane-ethylenesilane copolymers), and the like; 
cellulose polymers, for example methyl or ethyl cellulose, hydroxy propyl 
methyl cellulose, and cellulose esters; polycarbonates; polytetrafluoroethylene; 

20 and the like. 

Preferably, a biologically acceptable adhesive polymer matrix should be 
selected from polymers with glass transition temperatures below room 
temperature. The polymer may, but need not necessarily, have a degree of 
crystallinity at room temperature. Cross-linking monomelic units or sites can be 

25 incorporated into such polymers. For example, cross-linking monomers can be 
incorporated into polyacrylate polymers, which provide sites for cross-linking 
the matrix after dispersing the therapeutic agent into the polymer. Known cross- 
linking monomers for polyacrylate polymers include polymethacrylic esters of 
polyols such as butylene diacrylate and dimethacrylate, trimethylol propane 

30 trimethacrylate and the like. Other monomers which provide such sites include 
allyl acrylate, allyl methacrylate, diallyl maleate and the like. 

Preferably, a plasticizer and/or humectant is dispersed within the 
adhesive polymer matrix. Water-soluble polyols are generally suitable for this 
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purpose. Incorporation of a humectant in the formulation allows the dosage unit 
to absorb moisture on the surface of skin which in turn helps to reduce skin 
irritation and to prevent the adhesive polymer layer of the delivery system from 
failing. 

5 Therapeutic agents released from a transdermal delivery system must be 

capable of penetrating each layer of skin. In order to increase the rate of 
permeation of a therapeutic agent, a transdermal drug delivery system must be 
able in particular to increase the permeability of the outermost layer of skin, the 
stratum coraeum, which provides the most resistance to the penetration of 
10 molecules. The fabrication of patches for transdermal delivery of therapeutic 
agents is well known to the art. 

For administration to the upper (nasal) or lower respiratory tract by 
inhalation, the therapeutic agents of the invention are conveniently delivered 
from an insufflator, nebulizer or a pressurized pack or other convenient means of 
15 delivering an aerosol spray. Pressurized packs may comprise a suitable 
propellant such as dichlorodifluoromethane, trichlorofluoromethane, 
dichlorotetrafluoroethane, carbon dioxide or other suitable gas. In the case of a 
pressurized aerosol, the dosage unit may be determined by providing a valve to 
deliver a metered amount. 
20 Alternatively, for administration by inhalation or insufflation, the 

composition may take the form of a dry powder, for example, a powder mix of 
the therapeutic agent and a suitable powder base such as lactose or starch. The 
powder composition may be presented in unit dosage form in, for example, 
capsules or cartridges, or, e.g., gelatine or blister packs from which the powder 
25 may be administered with the aid of an inhalator, insufflator or a metered-dose 
inhaler. 

For intra-nasal administration, the therapeutic agent may be administered 
via nose drops, a liquid spray, such as via a plastic bottle atomizer or metered- 
dose inhaler. Typical of atomizers are the Mistometer (Wintrop) and the 
30 Medihaler (Riker). 

The local delivery of the therapeutic agents of the invention can also be 
by a variety of techniques which administer the agent at or near the site of 
disease. Examples of site-specific or targeted local delivery techniques are not 
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intended to be limiting but to be illustrative of the techniques available. 
Examples include local delivery catheters, such as an infusion or indwelling 
catheter, e.g., a needle infusion catheter, shunts and stents or other implantable 
devices, site specific carriers, direct injection, or direct applications. 
5 For topical administration, the therapeutic agents may be formulated as is 

known in the art for direct application to a target area. Conventional forms for 
this purpose include wound dressings, coated bandages or other polymer 
coverings, ointments, creams, lotions, pastes, jellies, sprays, and aerosols. 
Ointments and creams may, for example, be formulated with an aqueous or oily 
10 base with the addition of suitable thickening and/or gelling agents. Lotions may 
be formulated with an aqueous or oily base and will in general also contain one 
or more emulsifying agents, stabilizing agents, dispersing agents, suspending 
agents, thickening agents, or coloring agents. The active ingredients can also be 
delivered via iontophoresis, e.g., as disclosed in U.S. Patent Nos. 4,140,122; 
15 4,383,529; or 4,05 1,842. The percent by weight of a therapeutic agent of the 
invention present in a topical formulation will depend on various factors, but 
generally will be from 0.01% to 95% of the total weight of the formulation, and 
typically 0.1-25% by weight. 

Drops, such as eye drops or nose drops, may be formulated with an 
20 aqueous or non-aqueous base also comprising one or more dispersing agents, 
solubilizing agents or suspending agents. Liquid sprays are conveniently 
delivered from pressurized packs. Drops can be delivered via a simple eye 
dropper-capped bottle, or via a plastic bottle adapted to deliver liquid contents 
dropwise, via a specially shaped closure. 
25 The therapeutic agent may further be formulated for topical 

administration in the mouth or throat. For example, the active ingredients may 
be formulated as a lozenge further comprising a flavored base, usually sucrose 
and acacia or tragacanth; pastilles comprising the composition in an inert base 
such as gelatin and glycerin or sucrose and acacia; and mouthwashes comprising 
30 the composition of the present invention in a suitable liquid carrier. 

The formulations and compositions described herein may also contain 
other ingredients such as antimicrobial agents, or preservatives. Furthermore, 
the active ingredients may also be used in combination with other therapeutic 
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agents, for example, bronchodilators. 

In particular, for delivery of a vector of the invention to a tissue such as 
muscle, any physical or biological method that will introduce the vector into the 
muscle tissue of a host animal can be employed. Vector means both a bare 
5 recombinant vector and vector DNA packaged into viral coat proteins, as is well 
known for AAV administration. Simply dissolving an AAV vector in phosphate 
buffered saline has been demonstrated to be sufficient to provide a vehicle useful 
for muscle tissue expression, and there are no known restrictions on the carriers 
or other components that can be coadministered with the vector (although 
10 compositions that degrade DNA should be avoided in the normal manner with 
vectors). Pharmaceutical compositions can be prepared as injectable 
formulations or as topical formulations to be delivered to the muscles by 
transdermal transport. Numerous formulations for both intramuscular injection 
and transdermal transport have been previously developed and can be used in the 
1 5 practice of the invention. The vectors can be used with any pharmaceutical^ 
acceptable carrier for ease of administration and handling. 

For purposes of intramuscular injection, solutions in an adjuvant such as 
sesame or peanut oil or in aqueous propylene glycol can be employed, as well as 
sterile aqueous solutions. Such aqueous solutions can be buffered, if desired, 
20 and the liquid diluent first rendered isotonic with saline or glucose. Solutions of 
the AAV vector as a free acid (DNA contains acidic phosphate groups) or a 
pharmacologically acceptable salt can be prepared in water suitably mixed with a 
surfactant such as hydroxypropylcellulose. A dispersion of AAV viral particles 
can also be prepared in glycerol, liquid polyethylene glycols and mixtures 
25 thereof and in oils. Under ordinary conditions of storage and use, these 

preparations contain a preservative to prevent the growth of microorganisms. In 
this connection, the sterile aqueous media employed are all readily obtainable by 
standard techniques well-known to those skilled in the art. 

The pharmaceutical forms suitable for injectable use include sterile 
30 aqueous solutions or dispersions and sterile powders for the extemporaneous 
preparation of sterile injectable solutions or dispersions. In all cases the form 
must be sterile and must be fluid to the extent that easy syringability exists. It 
must be stable under the conditions of manufacture and storage and must be 
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preserved against the contaminating action of microorganisms such as bacteria 
and fungi. The carrier can be a solvent or dispersion medium containing, for 
example, water, ethanol, polyol (for example, glycerol, propylene glycol, liquid 
polyethylene glycol and the like), suitable mixtures thereof, and vegetable oils. 
5 The proper fluidity can be maintained, for example, by the use of a coating such 
as lecithin, by the maintenance of the required particle size in the case of a 
dispersion and by the use of surfactants. The prevention of the action of 
microorganisms can be brought about by various antibacterial and antifungal 
agents, for example, parabens, chlorobutanol, phenol, sorbic acid, thimerosal and 
10 the like. In many cases it will be preferable to include isotonic agents, for 
example, sugars or sodium chloride. Prolonged absorption of the injectable 
compositions can be brought about by use of agents delaying absorption, for 
example, aluminum monostearate and gelatin. 

Sterile injectable solutions are prepared by incorporating the AAV vector 
1 5 in the required amount in the appropriate solvent with various of the other 
ingredients enumerated above, as required, followed by filtered sterilization. 
Generally, dispersions are prepared by incorporating the sterilized active 
ingredient into a sterile vehicle which contains the basic dispersion medium and 
the required other ingredients from those enumerated above. In the case of 
20 sterile powders for the preparation of sterile injectable solutions, the preferred 
methods of preparation are vacuum drying and the freeze drying technique which 
yield a powder of the active ingredient plus any additional desired ingredient 
from the previously sterile-filtered solution thereof. 

For purposes of topical administration, dilute sterile, aqueous solutions 
25 (usually in about 0. 1% to 5% concentration), otherwise similar to the above 
parenteral solutions, are prepared in containers suitable for incorporation into a 
transdermal patch, and can include known carriers, such as pharmaceutical grade 
dimethylsulfoxide (DMSO). 

The therapeutic compounds of this invention may be administered to a 
30 mammal alone or in combination with pharmaceutically acceptable earners. As 
noted above, the relative proportions of active ingredient and carrier are 
determined by the solubility and chemical nature of the compound, chosen route 
of administration and standard pharmaceutical practice. 
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The dosage of the present therapeutic agents which will be most suitable 
for prophylaxis or treatment will vary with the form of administration, the 
particular compound chosen and the physiological characteristics of the 
particular patient under treatment. Generally, small dosages will be used 
5 initially and, if necessary, will be increased by small increments until the 

optimum effect under the circumstances is reached. Exemplary dosages are set 
out in the example below. 

Since AAV has been shown to have a broad host range (for pulmonary 
expression) and persists in muscle, the vectors of the invention may be employed 
1 0 to express a gene in any animal, and particularly in mammals, birds, fish, and 
reptiles, especially domesticated mammals and birds such as cattle, sheep, pigs, 
horses, dogs, cats, chickens, and turkeys. Both human and veterinary uses are 
particularly preferred. 

The gene being expressed can be either a DNA segment encoding a 
15 protein, with whatever control elements (e.g., promoters, operators) are desired 
by the user, or a non-coding DNA segment, the transcription of which produces 
all or part of some RNA-containing molecule (such as a transcription control 
element, +RNA, or anti-sense molecule). 

Muscle tissue is a very attractive target for in vivo gene delivery and gene 
20 therapy, because it is not a vital organ and is very easy to access. If a disease is 
caused by a defective gene product which is required to be produced and/or 
secreted, such as hemophilia, diabetes and Gaucher's disease, and the like, is 
muscle is a good candidate to supply the gene product if the appropriate gene can 
be effectively delivered into the cells. 
25 Different vectors, such as naked DNA, adenovirus and retrovirus, have 

been utilized to directly deliver various transgenes into muscle tissues. 
However, neither system can offer both high efficiency and long-term 
expression. For naked plasmid DNA directly delivered into muscle tissue, the 
efficiency is not high. There are only a few cells near the injection site that can 
30 maintain transgene expression. Furthermore, the plasmid DNA in the cells 

remains as non-replicating episomes, i.e., in the unintegrated form. Therefore, it 
will be eventually lost. For adenovirus vector, it can infect the non-dividing 
cells, and therefore, can be directly delivered into the mature tissues such as 
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muscle. However, the transgene delivered by adenovirus vectors are not useful 
to maintain long-term expression for the following reasons. First, since 
adenovirus vectors still retain most of the viral genes, they are not very safe. 
Moreover, the expression of those genes can cause the immune system to destroy 
5 the cells containing the vectors (see, for example, Yang et ah 1994, Proc. Natl. 
Acad. Sci. 91:4407-441 1). Second, since adenovirus is not an integration virus, 
its DNA will eventually be diluted or degraded in the cells. Third, due to the 
immune response, adenovirus vector could not be repeatedly delivered. In the 
case of lifetime diseases, this will be a major limitation. For retrovirus vectors, 
1 0 although they can achieve stable integration into the host chromosomes, their use 
is very restricted because they can only infect dividing cells while a large 
majority of the muscle cells are non-dividing. 

Adeno-associated virus vectors have certain advantages over the above- 
mentioned vector systems. First, like adenovirus, AAV can efficiently infect 
1 5 non-dividing cells. Second, all the AAV viral genes are eliminated in the vector. 
Since the viral-gene-expression-induced immune reaction is no longer a concern, 
AAV vectors are safer than Ad vectors. Thirds, AAV is an integration virus by 
nature, and integration into the host chromosome will stably maintain its 
transgene in the cells. Fourth, AAV is an extremely stable virus, which is 
20 resistant to many detergents, pH changes and heat (stable at 56°C for more than 
an hour). It can be lyophilized and redissolved without losing its activity. 
Therefore, it is a very promising delivery vehicle for gene therapy. 

The invention will be further described by, but is not limited to, the 
following examples. 

25 

Example I 

Materials and Methods 
Construction of rAAV Shuttle Vpr.tn r 

A recombinant AAV shuttle vector (AV.GFP3ori) which contained a 
30 GFP transgene cassette, bacterial ampicillin resistance gene, and bacterial origin 
of replication, was generated from a cw-acting plasmid (pCisAV.GFP3ori). 
Expression of the GFP gene was directed by the CMV promoter/enhancer and 
S V40 poly-adenylation sequences. pCisAV.GFP3ori was constructed with 
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pSub201 derived ITR elements (Samulski et al., 1987) and the intactness of ITR 
sequences was confirmed by restriction analysis with Smal and PvuII, and by 
sequencing. Recombinant AAV stocks were generated by co-transfection of 
pCisAV.GFP3ori and pRep/Cap together with co-infection of recombinant 
5 Ad.CMVlacZ in 293 cells (Duan et al., 1 997). Following transfection of forty 
1 50 mm plates, cells were collected at 72 hours by centrifiigation and 
resuspended in 12 ml of buffer (10 mM Tris pH 8.0). Virus was released from 
cells by three cycles of freeze/thawing and passaged through a 25 gauge needle 
six times. Cell lysates were then treated with 1.3 mg/ml DNase I at 37°C for 30 
10 minutes and 1% deoxycholate (g/ml final) and 0.05% trypsin (g/ml final) at 
37 °C for 30 minutes. Samples were then placed on ice for 10 minutes and 
centrifuged to remove large particulate material at 3,000 rpm for 30 minutes. 

rAAV was purified by isopycnic density gradient centrifiigation in CsCl 
(r=1.4) in a SW55 rotor for 72 hours at 35K. Peak fractions of AAV were 
15 combined and re-purified through two more rounds of CsCl centrifiigation, 

followed by heating at 58 °C for 60 minutes to inactivate all contaminant helper 
adenovirus. Typically, this preparation gave approximate AAV titers of 10 12 
DNA molecules/ml and 2.5 x 10 8 GFP-expressing units/ml. Recombinant viral 
titers were assessed by slot blot and quantified against pCisAV.GFP3ori controls 
20 for DNA particles. Functional transducing units were quantified by GFP 
transgene expression in 293 cells. The absence of helper adenovirus was 
confirmed by histochemical staining of rAAV infected 293 cells for 
beta-galactosidase, and no recombinant adenovirus was found in 10 10 particles of 
purified rAAV stocks. The absence of significant wtAAV contamination was 
25 confirmed by immunocytochemical staining of rAAV/Ad co-infected 293 cells 
with anti-Rep antibodies. These studies, which had a sensitivity of 1 wtAAV in 
10 10 rAAV particles, demonstrated an absence of Rep staining as compared to 
pRep/Cap plasmid transfected controls. 

Isolation and Structural Evaluation of AAV Circular Intermediates From Hela 
30 Cells. 

Hela cells were grown in 35 mm dishes in DMEM media supplemented 
with 10% fetal calf serum (FCS). Cells were infected in the presence of 2% FCS 
at 80% confluency with recombinant AV.GFP3ori (MOI^IOOO particles/cell, 1 x 
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10 9 total particles/plate) and Hirt DNAs isolated as described by Duan et al. 
(1997) at 6, 12, 24, 48, and 72 hours post-infection. In experiments analyzing 
the effects of adenovirus, plates were co-infected with Ad.CMVLacZ 
(MOI=5000 particles/cell) in the presence of 2%FCS/DMEM. Zero hour 
5 controls were generated by mixing 1 0 9 particles of AV.GFP3ori with cell lysates 
prior to Hirt DNA preparation. Hirt DNA isolated at each time point was used to 
transform E. coli SURE cells (Stratagene, La Jolla, CA.). Typically, 1/10 of the 
Hirt DNA preparation was used to transform 40 ml of competent bacteria by 
electroporation. The resultant total number of bacterial colonies was quantified 
10 for each time point and the structure of circular intermediates was evaluated for 
greater than 20 plasmid clones for each time point from two independent 
experiments. Structural determinations were based on restriction enzyme 
analysis using PstI, SphI, Asel single and double digests together with Southern 
blotting against GFP, stuffer, and ITR probes. 
15 Evaluation of E2a and GFP gene expression in Hela cells 

E2a gene expression was evaluated by immunofluorescent staining of 
Hela cells superinfected with El -deleted Ad.CMVlacZ (MOI= 0, 500, 5000 
particles/cell). Briefly, cells were fixed in methanol at -20°C for 10 minutes 
followed by air drying. Cells were then incubated at room temperature with 
20 hybridoma supernatant against Ad5 72kd DBP (Reich et al, 1983), followed by 
goat anti-mouse-FITC antibody (5 mg/ml) for 30 minutes at room temperature. 
In studies evaluating augmentation of AAV GFP transgene expression by 
adenovirus, Hela cells were harvested at 24 or 72 hours post-infection by 
trypsinization, resuspended in 2%FCS/PBS and evaluated by FACS analyses. 
25 Thresholds were set using uninfected controls and the percentage and/or the 

average relative fluorescent intensity was determined by sorting greater than 10 3 

cells per experiment condition. 

Sequence Analysis of AAV Circular Intermediates 

Sequence analysis of the ITR array within circular intermediates was 
30 performed using primers ELI 18 (5 '-CGGGGGTCGTTGGGCGGTC A-3 ' ; SEQ 
ID NO:l) and EL230 (5 ' -GGGCGG AGCCTATGG AAAA-3 ' ; SEQ ID NO:2) 
which are nested to 5' and 3' ITR sequences, respectively. Both circular and 
linearized (with Smal which cuts within ITR sequences) plasmids were 
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sequenced. 
Results 

Construction of rAAV Shuttle Vector and N ation of Circular Tntgrrr^ rljatrs 

To circumvent the inability to retrieve pre-integration intermediates or as 
5 stable episomal forms resistant to nuclease digestion, an alternative strategy was 
' developed to "trap" circular intermediates using a recombinant AAV shuttle 
vector. Recombinant AV.GFP3ori virus (Figure IB) was generated from a 
m-acting plasmid (pCisAV.GFP3ori, Figure 1 A) by co-transfection in 293 cells 
with trans-acting plasmids encoding Rep and Cap viral genes. This viral vector 
1 0 (AV.GFP3ori) encoded the green fluorescent protein (GFP) reporter gene, a 
bacterial origin of replication (ori), and the bacterial ampicillin-resistance gene. 
Ori and ampicillin-resistance sequences encoded in this virus allow for the 
rescue of circular AAV genomes formed during the transduction process. 
To test this strategy, Hela cells were infected with AV.GFP3ori 
1 5 (MOI=l 000 particles/cell) and the abundance of circular intermediates was 
evaluated following transformation of low molecular weight cellular Hirt DNA 
into E. coli SURE cells. The presence of circular intermediates was inferred by 
retrievable ampicillin-resistant bacterial colonies. Structural features of circular 
intermediates were determined by restriction enzyme analysis and Southern 
20 blotting with various regions of the provirus, including GFP, Stuffer, and ITR 
sequences. 

The predominant circular form isolated after transduction of Hela cells 
with AV.GFP3ori consisted of 4.7 kb monomer-sized molecules (Figure 1C). 
SphI digestions of these circular intermediates yielded characteristic 300 bp 

25 bands which hybridized to an ITR probe on Southern blots (Figure 2A). PstI, 
SphI, Asel single and double digests together with Southern blot analysis using 
GFP, Stuffer (data not shown), and ITR (Figure 2A) probes confirmed the 
structure of the circular intermediates as head-to-tail monomer genomes (Figure 
1C). In particular, PstI digests together with ITR Southern blots distinguish 

30 these head-to-tail circular intermediates from head-to-head or tail-to-tail circular 
dimers. Similar results obtained with studies on AV.GFP3ori infected 293 cells 
and primary fibroblasts have confirmed that monomer head-to-tail circular 
intermediates were also the most abundant form in these cell types. 
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Because the predicted molecular weight of an intact head-to-tail ITR 
SphI fragment would be approximately 360 bp, an anomalous migration in 
agarose gels might be due to the high secondary structure of inverted repeats 
within ITRs. To this end, the head-to-tail orientation of the ITRs, as predicted 
5 by Southern blot analysis, was confirmed using several sequencing strategies. 
First, the SphI ITR hybridizing fragment of a circular intermediates was 
subcloned into a secondary plasmid vector and sequenced with primers outside 
the ITR cloned sequences. These findings confirmed the head-to-tail orientation 
of ITRs. Additionally, sequence was obtained directly from six monomer 

10 circular intermediate clones using primers internal to both the 5' and 3' ITRs 
(Figure 2C). In these studies, circular intermediates were digested with Smal 
and the linear 4.6 kb plasmid was gel isolated prior to sequencing. Smal 
digestion (which relaxed the secondary structure of ITRs) was necessary to 
obtain sequence information within the ITRs. The sequencing results presented 

1 5 in Figure 2C confirmed the orientation of head-to-tail ITR arrays in these 
intermediates. 

Interestingly, sequencing also revealed several consistent base pair (bp) 
changes in four of the six clones analyzed (Figure 2C). These four clones (p79, 
p81, p87, and p88) had consistent two bp changes within the D-sequence [G->A 
20 (122 bp) and A->G (125 bp)], which always occurred together with the bp 
alterations in the p5 promoter [A->G (1 14 bp) and A->C (1 15 bp)]. No other 
consistent bp changes were noted except for two clones (p79 and p88) which 
demonstrated mutations just outside the 3TTR D-sequence [T->G (381 bp) and 
T->C(383 bp)]. 

25 Although head-to-tail circular intermediates were the most abundant 

forms present in Hirt DNA from rAAV infected Hela cells, several less frequent 
structures were also detected. These included monomer circularized AAV 
genomes with one (pi 90) and three ITRs (p345) arranged in a head-to-tail 
fashion as well as several clones with an unknown structure lacking complete 

30 ITRs (p340) (Figure 2A). Such diversity within the ITR array may represent 
homologous recombination in vivo or in bacteria during amplification. However, 
previous studies demonstrating similar variations in ITR sequences of 
head-to-tail integrated genomes, suggest that such changes in the length of the 
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ITR array may occur in vivo (Duan et al., 1997) Additionally, less frequent 
head-to-tail circularized multimer forms were predicted based on the variation in 
migration patterns of uncut plasmids which gave identical restriction patterns. 
Results shown in Figure 2B confirmed the existence of monomer and dimer 
5 head-to-tail circular intermediates using partial digestion with an enzyme which 
cuts once in the AAV genome (Asel). Cumulative analysis of greater than 200 
independently isolated circular intermediates from Hela cells demonstrated that 
head-to-tail circular AAV genomes occurred in greatest abundance as monomers 
(92%) and less frequently as multimers of greater than one genome (8%). 
10 To establish that head-to-tail circular intermediates were formed in vivo 

and not by non-specific bacterial recombination of linear AAV genomes present 
in the Hirt DNA, a set of reconstitution experiments was performed by which the 
same number of rAAV particles used for infection experiments were spiked into 
Hela cell lysates prior to Hirt preparations. In these studies, background 
15 bacterial amplification of Hirt DNA spiked with linear rAAV genomes was 
negligible (Figure 3D) and of the few isolated colonies obtained from these 
controls, none had a predicted head-to-tail structure as assessed by Southern blot 
restriction enzyme analysis (Figure 3E). Additionally, reconstitution 
experiments transforming bacterial with linearized dsDNA AAV genomes did 
20 not give rise to significant levels of replication competent plasmids or the 

characteristic head-to-tail structure associated with AAV circular intermediates. 
These findings confirm that circular intermediates do not likely arise from 
non-specific recombination or ligation events with either ssDNA or dsDNA 
linear AAV genomes in bacteria. Additional control experiments, demonstrating 
25 the lack of stuffer hybridizing sequences in AAV circular intermediates by 
Southern blotting, also confirm that these structures do not arise from 
contamination of viral stocks with pCisAV.GFP3ori plasmid. 
The form ation of head-to-tail circular AAV intermediates is augmented hy 
superinfection with El -deleted adenovirus. 
30 Many aspects of the wtAAV growth cycle are affected by helper 

adenovirus, including AAV DNA replication, transcription, splicing, translation, 
and virion assembly. Such studies have provided concrete evidence that a subset 
of Ad early gene products provide helper functions for the wtAAV lytic cycle, 
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including: El a, Elb, E2a, E4 0RF6 and VA1 RNA (Muzyczka, 1992). In this 
regard, one of the most critical factors which is required for AAV replication is 
the 34 kD E4 protein (ORF6). Recent observations on the helper function of Ad 
in rAAV transduction have also demonstrated that Ad E4 ORF6 is essential for 
5 the augmentation of rAAV transgene expression seen with adenovirus 

co-infection (Ferrari et al, 1996; Fisher et al, 1996). According to these reports, 
the rate-limiting step enhanced by these adenoviral proteins is the conversion of 
single stranded AAV genomes to double stranded forms. 

Studies evaluating the kinetics of rAAV circular intermediate formation 
1 0 demonstrated a time-dependent increase in abundance which peaked at 24 hours 
post-infection in Hela cells and coincided with the onset of GFP transgene 
expression (Figure 3). To better understand the cellular mechanisms associated 
with AAV circular intermediate formation, the effects of adenoviral co-infection 
on this process were evaluated. The extent of transgene expression and circular 
1 5 intermediate formation in AV.GFP3ori infected Hela cells with or without 
co-infection with El -deleted recombinant adenovirus was compared. 

Although El -deleted adenoviruses are severely handicapped in their 
ability to synthesize viral gene products, at high MOIs of >5000 significant E2a 
protein expression was noted (Figure 3 A). As an indicator of transgene 
20 expression, the abundance and average relative intensity of GFP positive cells 
was determined against mock infected controls by fluorescent microscopy 
(Figure 3B) and FACS analysis (Figure 3C) at 72 hours post-infection. In accord 
with previous reports demonstrating augmentation in rAAV transgene expression 
by adenovirus (Ferrari et al., 1996; Fisher et aL, 1996), the extent of GFP 
25 transgene expression was dramatically increased at doses of adenovirus which 
led to viral gene expression (MOI>5000; Figures 3 A-C). Additionally, 
persistence of rAAV transgene expression was also augmented by co-infection 
with El -deleted adenovirus, as determined by GFP-expressing colony formation 
following serial passages (Figure 3C). 
30 If circular intermediates represent a molecular form of rAAV important 

for efficient and/or persistent transgene expression, augmentation of rAAV 
transgene expression by adenovirus might also modulate circular intermediate 
formation. In these studies, the abundance and time course of AAV circular 
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intermediate formation was evaluated following superinfection with 
Ad.CMVLacZ. Results from these experiments are shown in Figure 3D, which 
represents the total number of bacterial colonies (per 35 mm plate) obtained 
following transformation of E. coli with Hirt DNA isolated from Hela cells 
5 infected with AV.GFP3ori (1000 DNA particles/cell) with or without 

co-infection with Ad.CMVlacZ (5,000 particles/cell). An MOI of 5000 Ad 
parti cles/cell was chosen for these experiments since this level of adenovirus led 
to minimal cytopathic effect (CPE) with high levels of E2a expression. 
These studies demonstrated a nearly 2-fold augmentation by 
10 Ad.CMVLacZ in the total abundance of AAV rescued plasmid intermediates in 
E. coli (Figure 3D). Southern blot restriction enzyme analysis demonstrated that 
the predominant forms in both the presence and absence of adenovirus were 
head-to-tail monomer circular intermediates containing the diagnostic 300 bp 
ITR fragment following SphI digestion (Figure 3E). Additionally, results 
15 demonstrated that adenovirus co-infection led to an earlier time of onset and 
increased stability of AAV head-to-tail monomer circular intermediates (Figures 
3E and F). For example, at 6 hours post-infection, head-to-tail circular 
intermediates were only present in Hela cells co-infected with adenovirus. 
Furthermore, a decline in the percentage of head-to-tail circular intermediate 
20 clones was seen at 48-72 hours post- AAV infection in the absence of adenovirus. 
In contrast, this decline was significantly blunted by the presence of helper 
adenovirus (Figure 3F). Based on these findings, it was concluded that certain 
adenoviral proteins produced by superinfection with El-deleted adenovirus were 
capable of modulating circular intermediates formation and stability during 
25 rAAV transduction. 
Discussion 

In the present study, it was shown that circularization of linear AAV 
genomes occurs during rAAV transduction. Circularization appears to 
predominately occur as head-to-tail monomer genomes. However, the existence 
30 of less abundant circular multimer forms suggests that recombinational events 
subsequent to the initial infection may drive concatamerization of circular 
genomes. The diversity in the length of ITR arrays found within circular 
intermediates (i.e., 1-3 ITRs) also supports the notion that these forms may be 
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highly recombinagenic. Of mechanistic interest in the formation of circular 
intermediates is the uniformity of mutations observed in the D-sequences and 
nearby p5 promoter region and the confinement of these mutations to the 
5'-ITRs. Although the etiology of these base pair changes is unknown, their 
5 uniformity suggests that they may have a direct role in the formation of circular 
intermediates and in increased stability. Recent findings, which suggest that an 
endogenous host single strand D-sequence binding protein is important in rAAV 
transduction, lend support to the potential involvement of this sequence in 
circular intermediate formation (Wang et al., 1997; Qing et al., 1998). 
10 Furthermore, it remains to be determined whether the in vivo formation of AAV 
circular intermediates occurs through the circularization of single or double 
stranded AAV genomes. 

By analogy, retroviral transduction intermediates have striking 
similarities to the cunrent findings with AAV. Three DNA forms have been 
1 5 isolated following retroviral infection, including linear DNA with long terminal 
repeats (LTRs) at both ends, circular DNA with one LTR, and circular DNA 
with multiple LTRs (Panganiban, 1985). Although it is disputed which of these 
forms are the direct precursor to integration, the existence of circular retroviral 
genomes which also have similar repeat regions at the ends of their genomes 
20 suggests the potential for common mechanisms with the formation of AAV 
circular intermediates. These AAV circular intermediates could act as 
integration precursors and/or stable episomal genomes. 

The head-to-tail ITR structures found in AAV circular intermediates are 
most characteristic of latent integrated AAV genomes. In contrast, lytic phases 
25 of AAV growth are typically associated with head-to-head and tail-to-tail 

replication form genomes. Hence, it is likely that circular intermediates represent 
a latent aspect of the AAV life cycle. The finding that co-infection with 
adenovirus leads to increased abundance and stability of AAV circular 
intermediates suggests a novel link between adenoviral helper functions and 
30 latent infection of AAV. 

Aspects of inverted head-to-tail ITRs, which include palindromic 
hairpins similar in structure to "Holliday-like" junctions, might impart 
recombinagenic activity which aids in viral integration. Such Holliday junctions 
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have been shown to play critical roles in directing homologous recombination in 
bacteria through the processing of recombination intermediates by RuvABC 
proteins (West, 1997; Lee et al., 1998). Interestingly, a mammalian 
endonuclease, analogous to bacterial RuvC resolvase, has also been isolated 
5 from cell lines (Hyde et al., 1 994). Despite the theoretical considerations which 
might suggest that circular AAV genomes have characteristics of preintegration 
intermediates, a study with recombinant retrovirus has demonstrated that 
palindromic LTR-LTR junctions of MMLV are not efficient substrates for 
proviral integration (Lobel et al., 1989). Nonetheless, circular AAV genomes 
10 have been previously proposed as integration intermediates based on proviral 
structure (Linden et al., 1996). 



Rvamplf! 7 

Methods 

15 Production of rAAV ShnfH* yp r ^ r 

The c/j-acting plasmid (pCisAV.GFP3ori) used for rAAV production 
was generated by subcloning the Bspl201/Not I fragment (743 bp) of the GFP 
transgene from pEGFP-1 (Clontech) between the CMV enhancer/promoter and 
SV40polyA by blunt-end ligation. A 2.5 kb cassette containing beta-lactamase 
20 and bacterial replication origin from P UC19 was blunt ligated down-stream of 
GFP reporter cassette. The ITR elements were derived from pSub201 .2 The 
entire plasmid contains a 4.7 kb AAV component flanked by a 2 kb stuffer 
sequence. The integrity of ITR sequences was confirmed by restriction analysis 
with Smal and PvuII, and by direct sequencing using a modified di-deoxy 
25 procedure which allowed for complete sequence through both 5 ' and 3 ' ITRs. 
Recombinant AAV stocks were generated by co-transfection of 
pCisAV.GFP3ori and pRep/Cap together with co-infection of recombinant 
Ad.CMVlacZ in 293 cells. The rAV.GFP3ori virus was subsequently purified 
through 3 rounds of CsCl banding as described in Duan et al., 1997. The typical 
30 yields from this viral preparation were 1012 DNAmolecules/ml. 

DNA titers were determined by viral DNA slot blot hybridization against 
GFP 32 P-labeled probe with copy number plasmid standards. The absence of 
helper adenovirus was confirmed by histochemical staining of rAAV infected 
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293 cells for beta-galactosidase, and no recombinant adenovirus was found in 
10 10 particles of purified rAAV stocks. The absence of significant wtAAV 
contamination was confirmed by immunocytochemical staining of rAAV/Ad 
co-infected 293 cells with anti-Rep antibodies. Transfection with pRep/Cap was 
5 used to confirm the specificity of immunocytochemical staining. No 

immunoreactive Rep staining was observed in 293 cells infected with 10 10 rAAV 
particles. 

Isolation of AAV Circular Intermediat es From MiisrJft , 

The tibialis anterior muscle of 4-5 week old C57BL/6 mice were infected 
10 with AV.GFP3ori (3 X 10 10 particles) in Hepes buffered saline (30 GFP 
expression was analyzed by direct immunofluorescence of freshly excised tissues 
and/or in formalin-fixed cryopreserved tissue sections in four independently 
injected muscles harvested at 0, 5, 10, 16, 22 and 80 days post-infection. Tissue 
sections were counter-stained with propidium iodide to identify nuclear DNA. 
15 Hirt DNA (Hirt, 1967) (20 ml per muscle sample) was isolated from at least 
three independent muscle specimen for each time point and used to transform E, 
coli SURE cells using 3 ml of Hirt with 40 ml of electrocompetent bacterial 
(approximately 1 x 10 9 cfii/ug DNA, Strategene Inc.). The resultant total number 
of bacterial colonies was quantified for each time point and the abundance of 

20 head-to-tail circular intermediates was evaluated for each time point (> 20 
bacterial clones analyzed) by PstI, Asel, SphI, and Pstl/Asel digestion, and 
confirmed by Southern blot analysis using ITR, GFP and stuffer probes. The 
head-to-tail configuration in typical clones were also confirmed by dideoxy 
sequencing using primers ELI 18 (5 '-CGGGGGTCGTTGGGCGGTC A-3 ' ; SEQ 

25 ID NO:l) and EL230 (5 ' -GGGCGGAGCCTATGGAAAA-3 ' ; SEQ ID NO:2) 
which are nested to 5' and 3' ITR sequences, respectively. Zero hour controls 
were generated by mixing 3 x 10 10 particles of AV.GFP3ori with control 
uninfected muscle lysates prior to Hirt DNA preparation. As described in Table 
1 , a number of additional controls for were performed to rule out non-specific 

30 recombination of linear AAV genomes in bacteria as a source for isolated 
circular intermediates. 
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Table 1. Control Experiments for Rescue of Circular Intermediates in Bacteria 





Type of Input 
DNA 


Source of DNA 


Number of 
Molecules 


Number of 
Amp Resistant 
Bacterial 
Colonies 


Presence of 
Head-to-Tail 

Circular 
Intermediates 

e 


5 


Purified rAW 


Hirt from Infected 
Muscle (22 day) 


3 x 10 10 


approximately 
5 x 10 3 


Yes 




Purified rAAV 


Virus 
reconstituted into 
Uninfected 
Muscle Hirt a 


3x 10 10 


0 


No 


10 


Linear ssDNA 
Encompassing 
rAAV 
Genome b 


Isolated from 
Purified Virus 


3xlO to 


2 


No 




Linear dsDNA 
Encompassing 
Entire rAAV 
Genome 


Isolated from 
proviral plasmid 
(Hindm/PvuII) c 


3.x 10 10 


3 


No 


15 


Linear dsDNA 
Encompassing 
Entire rAAV 
Genome 
+ ligase d 


Isolated from 
proviral plasmid 
(Hindlll/PvuII) 


3 x 10 10 


>6x 10 3 


Yes 



20. 



Purified virus was reconstituted into muscle homogenates prior to preparation 
of Hirt DNA. 



b Viral DNA predominantly contained single stranded genomes as evident by 
25 Southern blot analysis against with ITR probe. However, small amount of 
dsDNA AAV genomes also existed and are likely due to reannealing of single 
stranded genomes during preparation. Purified viral DNA concentrations were 
determined by OD 260 and 75 ng representing approximately 3 x 10 10 viral 
genomes were used for transformation of bacteria. 

30 

c Hindm/PvuII digestion was used to remove the entire rAAV genome from 
pcisAV.GFP3ori. Hindlll and PvuII leave 10 and 0 bps of flanking sequence 
outside the 5' and 3' ITRs, respectively. The linear dsDNA fragment (4.7 kb) 
was gel isolated following blunting with T4 DNA polymerase and the DNA 
35 concentration determined by OD 260 . One hundred and fifty ng of linear fragment 
representing approximately 3 x 10 i0 viral genomes were used for transformation 
of bacteria. 
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d Linear dsDNA viral genomes (Hindm/PvuII blunted fragment) were treated 
with T4 DNA ligase prior to transformation of bacteria. 

e The presence of head-to- tail circular AAV intermediates were confirmed by 
5 restriction enzyme digestion (Asel, PstI, and SphI) and Southern blotting against 
ITR probe. 

Fractionation of muscle Ri rt DNA p re parations 

Preparative-scale fractionation of the muscle Hirt DNA was performed 
10 by 1 % agarose gel electrophoresis using the Bio-Rad Mini Prep Cell (Catalog 
#170-2908). A 4.5 ml (10.5 cm) tubular gel containing 1 x TBE buffer was 
poured according to manufacturer's specification. A total of 20 ml Hirt 
preparation from one entire muscle sample was loaded on top of the gel. 
Electrophoresis was carried out at a constant current of 10 mA over a period of 
1 5 5 hours. Sample eluent was drawn from the preparative gel apparatus by a 
peristaltic pump at a rate of 100 ml/min and eluted into a fraction collector at 
250 ml/fraction. The collected DNA was subsequently concentrated by standard 
ethanol precipitation and used to transform SURE bacterial cells by 
electroporation as described above. 
20 In vitro Persistence of A A V rimnlar Int ermediates 

Transgene expression and persistence of AAV circular intermediate 
plasmid clones were evaluated following transient transfection in Hela and 293 
cells. Subconfluent monolayers of Hela cells in 24-well dishes were transfected 
with 0.5 mg of either AAV circular intermediates (p81 or p87) or pCMVGFP 
25 using Lipofectamine (Gibco BRL Inc.). The cultures were then incubated for 5 
hours in serum free DMEM followed by incubation in DMEM supplemented 
with 10% fetal bovine serum. All plasmid DNA samples used for transfections 
were spiked with pRSVlacZ (0.5 mg) as an internal control for transfection 
efficiency. At 48 hours post-transfection, cells were passaged at a 1:10 dilution 
30 and allowed to grow to confluency (day 5), at which time GFP clones were 
quantified for size and abundance using direct fluorescent microscopy. The 
percent of beta-galactosidase-expressing cells was also quantified at this time 
point by X-gal staining. At 5 days, cells were passaged an additional time (1:15 
dilution) GFP clones were quantified again at day 10. The persistence of 
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plasmid DNA at passage-5, 7, and 10 days post-transfection was evaluated by 
Southern blot analysis of total cellular DNA using 32 P-labeled GFP probes. To 
determine whether the head-to-tail ITR array within circular intermediates was 
responsible for increases in the persistence of GFP expression, the head-to-tail 
5 ITR DNA element was subcloned into the pGL3 luciferase plasmid to generate 
pGL3(ITR). The head-to-tail ITR DNA element was isolated from a monomer 
circular intermediate (p81).by AatH and Haell double digestion and 
subsequently inserted into the Sail site of pGL3 (Promega) by blunt ligation. 
The resultant plasmid pGL3(ITR) contains the luciferase reporter and 
1 0 head-to-tai 1 ITR element 3 ' to the polyA site. The integrity of the ITR DNA 
element within this plasmid was confirmed by sequencing. The persistence of 
transgene expression from pGL3(ITR) was compared to that of pGL3 by 
luciferase assays on transiently transfected Hela cells as described above and 
analyzed at 10 days (passage-2). Transfection efficiencies were normalized 
15 using a dual renilla luciferase reporter vector (pRLS V40, Promega). 
Results 

AAV Circul a r Intermediates Kenresent Stahl* R P i«n ma l Fnrmc »f y ) n ,| p NA 
Associated with T,on"-term Persistence of Tranc e F.ypm^nn j n Mmdr 

To evaluate the molecular characteristics of rAAV genomes in muscle, a 
20 rAAV shuttle viral vector (AV.GFP3ori) was utilized which harbors an 

ampicillin resistance gene, bacterial origin of replication, and GFP reporter gene 
(Figure 1A). This recombinant virus was used to evaluate the presence of 
circular intermediates by bacterial rescue of replication competent plasmids. In 
these studies, delivery of AV.GFP3ori (3 x 1010 particles) to the tibialis muscle 
25 of mice led to GFP transgene expression which peaked at 22 days and remained 
stable for at least 80 days (Figure 4A). These results confirmed previous 
successes in rAAV mediated gene transfer to muscle (Kessler et al., 1996; 
Herzog et al., 1997; Xiao et al., 1996; Clark et al., 1997; Fisher et al., 1997). 
The formation of circular intermediates was evaluated by E. coli transformation 
30 of Hirt DNA harvested from muscle at 0, 5, 10, 16, 22, and 80 days 
post-infection with AV.GFP3ori. 

In these muscle samples, circular intermediates were found to have a 
characteristic head-to-tail structure with 1-2 ITR repeats. The most abundant 
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form included two inverted ITRs within a circularized genome (Figure 4B, clone 
pi 7). This figure also depicts a less frequent form (< 5%) of circular 
intermediates observed, p439, with undetermined structure. When this type of 
replication competent plasmid was seen, it was not included in the 
5 quantification of head-to-tail circular intermediates since its structure could not 
be conclusively determined. The total abundance of muscle Hirt derived 
head-to-tail circular intermediates (with 1-2 ITRs) demonstrated a 
time-dependent increase that peaked with transgene expression at 22 days and 
slightly decreased by day 80 (Figure 5A). Increased diversity in the length of 
10 ITR arrays within circular intermediates was seen at longer time points. For 

example, Figure 5B demonstrates several isolated circular intermediates with 1-3 
ITRs isolated from 80 days muscle Hirt samples. This is in contrast to the more 
uniform structure of circular intermediates with two ITRs in a head-to-tail 
conformation at 5-22 days post-infection. 
15 To evaluate the potential for artifactual rescue of linear rAAV genomes 

by recombination in bacteria, several control experiments were performed. First, 
uninfected control muscle Hirt preparations, spiked with an equal amount of 
rAAV virus used for in vivo infection of muscles, failed to give rise to 
replicating plasmids following transformation of E.coli. Second, when a blunted 
20 linear double stranded HindHI/PvuII fragment isolated from pcisAV.GFP3ori 
(encompassing the entire rAAV genome) was used to transform bacteria, no 
ampicillin resistant bacterial colonies were obtained. The addition of T4 ligase 
to this fragment, however, led to significant numbers of bacterial colonies. 
Third, when purified single stranded rAAV DNA was used for transformation, 
25 no bacterial colonies were obtained. As summarized in Table 1, these results 
confirm that in the absence of productive infection, rAAV genomes themselves 
are incapable of recombining into replication competent plasmids in bacteria. 
Hence, in vivo circularization of rAAV genomes is a prerequisite for rescuing 
autonomously replicating plasmids in E. coli with this shuttle vector. 
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Molecular weight of circular intermediates suggest a conversion from monomer 
to multimer for ms over time* 

To further characterize the circular intermediates isolated from muscles, 
Hirt samples from 22 days and 80 days post-infected muscles were size 
5 fractionated by continuous-flow gel electrophoresis (BioRad). As shown in 
Figure 6 , the majority of circular intermediates at 22 days post-infection size 
fractionated at a molecular weight of less than 3 Kbp. Very few clones were 
isolated from fractions between 3 to 5 kb and no clones were obtained from 
fractions larger than 5 kb at this time point. Furthermore, this size fractionated 
10 molecular weight of in vivo Hirt derived circular intermediates at 22 day time 
points correlated with that of head-to-tail monomer undigested circular 
intermediate plasmids rescued in bacteria from this same time point 
(approximately 2.5 kb). These data suggest that at early time points 
post-infection in muscle, the predominant form of circular intermediates likely 
15 occurs as monomer genomes. The lower mobility of this fraction as compared to 
replication form monomer (Rfm=4.7 kb) and dimer (Rfd=9.4 kb) genomes 
provides indirect evidence that these forms are not responsible for rescued 
plasmids in these Hirt samples. Interestingly, when 80 day muscle Hirt samples 
were size fractionated, more clones were retrieved from higher molecular weight 
20 fractions ranging from 3-12 kb (Figure 6). This shift in the molecular weight of 
circular intermediates indicates the potential for recombination between 
monomer forms in the generation of large circular multimer genomes. Such 
concatamerization has been previously observed in muscle and has traditionally 
been hypothesized to involve linear integrated forms of the AAV genome 
25 (Herzog et al, 1997; Xiao et al., 1996; Clark et al, 1997; Fisher et al., 1997). 
This data sheds new light on the molecular characteristics of these persistent 
AAV genomes and suggests that they are in fact circular and episomal. Based on 
yields of retrievable circular plasmids reconstituted in Hirt DNA, the efficiency 
of bacterial transformation, and the initial innoculum of virus, we estimate that 
30 approximately 1 in 400 viral DNA particles circularize following infection in 
muscle (Table 2). 
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Table 2. Yield of Circular Intermediate Isolation from Hirt DNA 



Bacterial 
Transformation 


Starting Number 
of Plasmid or 
AAV Genomes 


Actual Number 
of Amp'cfu 


Adjusted 
Yield 


Hirt DNA from rAAV 
Infected Muscle a 


3 x 10'° molecules 


5 x 10 3 cfu 


5x 10 5 cfu e 


Hirt DNA + 230 ng 
LacZ Plasmid bc 


3 x 10 10 molecules 


2x 10 4 cfu d 


2x 10 8 cfu 


230 ng LacZ Plasmid c 


3x 10'° molecules 


2 x 10* cfu 





10 



15 



The actual amount of Hirt used for transformation was 3/20 the entire Hirt 
DNA. The numbers have been adjusted to reflect viral innoculum and yields for 



the entire muscle. 



b Plasmid DNA was spiked into mock infected muscle homogenates prior to 
isolation of Hirt DNA. This reconstituted Hirt DNA was then used for 
transformation of bacteria. 



20 



25 



30 



c The actual microgram amounts of plasmid used in reconstitution experiments 
was 10 ng. The numbers have been adjusted for comparison to normalize the 
number of plasmids genomes to that used in AAV experiments. Control LacZ 
plasmid was approximately 7000 bp with a molecular weight of 4.6 x 10 6 
g/mole. 

d The average of several experiments indicates an approximate 100-fold 
reduction in the number of cfu recovered from bacterial transformations with 
DNA isolated from Hirt extract spiked with plasmids as compared to 
transformation with an equivalent amount of plasmid DNA alone. 



Adjusted yield indicate approximately 1 in 400 AAV genomes circularize 



VIVO. 



in 



Given the fact that not all rAAV particles likely contain functional DNA 
35 molecules and intermediates may integrate, these calculations may represent 
underestimation. 



an 



40 



AAV Circular Intermpdi a tes Demonstrate lnc.rr. w A Persktpnre as PlasmiH 
Based Vectors 

Based on the finding that circular AAV intermediates were associated 
with long term persistence of transgene expression in muscle, rAAV circular 
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head-to-tail intermediates may be molecular structures of the AAV genome 
associated with the latent life cycle and increased episomal stability. Several 
aspects of the structure of AAV circular intermediates may account for their 
increased stability in vivo. First, circularization of AAV genomes may create a 
5 nuclease resistant conformation. Secondly, since the only viral sequences 
contained within circular intermediates are the head-to-tail ITR array, these 
sequences might bind cellular factors capable of stabilizing these structures in 
vivo. Several studies have demonstrated increased persistence of transgene 
expression with plasmid DNA encoding viral ITRs (Philip et al., 1994; Vieweg 
10 et al., 1995). The results described above provide a functional explanation for 
the increased persistence through the association with circular intermediate 
formation as part of the AAV life cycle. 

To more closely evaluate the persistence of AAV head-to-tail circular 
intermediates, several in vitro experiments were performed by transfecting these 
1 5 intermediates into Hela cells and assessing the stability of plasmid DNA and 
transgene expression by GFP clonal expansion. Results from Hela cell 
transfection experiments demonstrated that two monomer head-to-tail circular 
intermediates (p8 1 and p87) studied gave rise to a 1 0-fold higher number of five 
and ten day transgene-expressing clones, as compared to a control pCMVGFP 
20 plasmid lacking the ITR sequences (Figures 7A and B). Additionally, the size of 
GFP positive colonies at 5 days post-transfection was three-fold larger in Hela 
cells transfected with p81 and p87, as compared to the pCMVGFP control vector 
(Figures 7A and B). These studies suggest the AAV circular intermediates have 
increased stability of transgene expression and substantiate findings in muscle. 
25 To confirm the increased molecular persistence of head-to-tail circular 

intermediates following transfection into Hela cells, total DNA (low and high 
molecular weight) was isolated from cultures of pCMVGFP and p81 transfected 
Hela cells at various passages post-transfection and analyzed by Southern 
blotting. Southern blots hybridized to 32 P-labeled GFP probes demonstrated a 
30 significantly higher level of p8 1 plasmid DNA at passage-7 as compared to the 
control vector lacking the head-to-tail ITR sequence (Figure 7C). The majority 
of signal in undigested DNA samples was associated with a 4.7 kb band 
migrating at the approximate size of the uncut monomer plasmids. Together 
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with the fact that the majority of signal from all cell cultures in Figure 7C 
disappeared by passage- 10, these data suggest that these plasmids predominantly 
remained episomal. Thus, in both muscle and Hela cells, increased persistence 
of AAV circular intermediates is correlated with stable transgene expression. 
5 ITR arrays are responsible for increased p ersistence 

To investigate whether the head- to-tail ITR DNA element was 
responsible for the increased persistence of circular intermediates, we cloned this 
DNA element into a secondary luciferase vector (pGL3) to give rise to 
pGL3(ITR). Transient transfection experiments in Hela cells demonstrated a 

1 0 five-fold increase in the persistence of luciferase expression in serially-passaged 
cultures at 10 days in pGL3(ITR) as compared to that of pGL3 transfected 
(Figure 7D). These findings support the hypothesis that the head-to-tail ITR 
DNA element contained within circular intermediates is responsible for 
mediating the increased persistence of transgene expression and suggest a 

1 5 mechanism by which these molecular intermediates may confer stability to AAV 
genomes in vivo. Furthermore, increases in the stability of transgene expression 
conferred by this element appear to be primarily context independent, since the 
head-to-tail ITR element was 3' to the luciferase gene in pGL3(ITR) and 5' to 
the GFP transgene in AAV circular intermediates. 
20 Discussion 

Characterization of integrated proviral structures in different cell lines 
has demonstrated head-to-tail genomes as the predominant structural forms for 
both wild type and recombinant AAV (McLaughlin et al., 1988; Cheung et al., 
1980; Duan et al, 1997). This is in contrast to the head-to-head and tail-to-tail 

25 structures observed in AAV replication intermediates (Rim and Rfd). Both Rfm 
and Rfd configurations have also been demonstrated in rAAV infected cells and 
enhanced conversion of ssAAV genomes to double stranded Rfrn and Rfd forms 
has been suggested as a mechanism for augmentation of rAAV transduction by 
adenovirus in cell lines (Ferrari et al., 1996; Fisher et al., 1996). However, it is 

30 plausible that the mechanisms responsible for the formation of Rfm and Rfd 
molecules are different from pathways which lead to long-term transgene 
expression. In support of this hypothesis is a recent study evaluating 
augmentation of rAAV transgene expression by adenovirus in liver (Snyder et 
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al., 1997). These studies have demonstrated that co-infection of the liver with 
adenovirus and rAAV enhances short term transgene expression while long term 
expression was no different than rAAV alone. The exact mechanism for the 
formation of head-to-tail circular intermediates is not clear, however similar 
5 structures have been demonstrated to act as pre-integration intermediates for 
retrovirus (Varmus, 1982). In this regard, circularized retroviral genomes with 
one and two viral LTRs have been proposed. In addition, circular pre-integration 
intermediates have also been suggested by recent studies on wtAAV integration 
(Linden et al., 1 996b). The demonstration that circular intermediates exist in 
1 0 rAAV infected muscle explains several features of latent phase infection with 
rAAV vectors including proviral structure and stable episomal persistence. 

Previous studies have suggested that rAAV genomes delivered to muscle 
might persist as head-to-tail concatamers (Herzog et al., 1997; Clark et al., 1997; 
Fisher et al., 1997). However, it is currently unknown whether these 
1 5 concatamers exist as free episomes or as integrated proviruses in the host 

genome. The results described above, i.e., demonstrating prolonged persistence 
of head-to-tail circular intermediates at 80 days post-infection, suggest that a 
large percentage of rAAV genomes may remain episomal. The conversion of 
monomer circularized genomes to larger circularized multimers appears to be an 
20 aspect associated with long term persistence and likely represents 

recombinational events between monomer intermediates. Although the bacterial 
rescue strategy was not capable of satisfactorily addressing the size of multimers, 
our modified approach to size fractionating Hirt DNA prior to bacterial rescue of 
intermediates lends support to this hypothesis. Additional supportive evidence 
25 for increased recombination over time is the finding that greater variability in the 
length of ITR arrays was observed at longer time points post-infection. For 
example, at 5-22 days the majority of circular intermediates contained 2 ITRs in 
a head-to-tail fashion. This is in contrast to 80 day time points where the lengths 
of ITR arrays ranges from 1-3 ITRs. Such diversity of ITR arrays in muscle 
30 infected with AAV has been previously found using PCR approaches (Herzog et 
al., 1997; Fisher et al, 1997). In addition, the 30% decline in the abundance of 
circular intermediates in muscle between 22 and 80 days also supports a 
hypothesis that these molecular forms of AAV may represent pre-integration 
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complexes. 

Given the fact that circular intermediates had long term persistence in 
muscle, certain structural features of these intermediates may affect episomal 
stability of DNA. Previous studies have noted increased persistence of transgene 
5 expression from plasmids encoding AAV ITRs (Philip et al., 1994; Vieweg et 
al, 1995). However, the physiologic significance of this finding has remained 
elusive. The present study, demonstrating the head-to-tail ITR arrays isolated 
from AAV circular intermediates can confer increased episomal persistence to 
plasmids following transfection in cell lines, gives a mechanistic framework for 
10 ITR effects on plasmid persistence. Furthermore, the correlation that AAV 
circular intermediates have increased persistence in cell lines in vitro, lends 
support to the hypothesis that these structures represent stable episomal forms 
following rAAV transduction in muscle. Stability of circular intermediates in 
vivo might be mediated by the binding of cellular factors to "Holliday-like" 
1 5 junctions in ITR arrays which stabilize or protect DNA from degradation. 

rAAV has been shown to be an efficient vector for expressing transgenes 
in various tissues in addition to muscle, such as brain, retina, liver, lung, and 
hematopoetic cells (Snyder et al., 1997; Muzyczka, 1992; Kaplitt et al., 1994; 
Walsh et al, 1994; Halbert et al, 1997; Koeberl et al., 1997; Conrad et al, 1996; 
20 Bennett et al, 1997; Flannery et al, 1997). Despite these advances in the 

application of rAAV, the mechanisms of in vivo rAAV-mediated transduction 
and persistence of transgene expression still remain unclear. Such questions as 
to the molecular state of rAAV following in vivo delivery is highly relevant to 
the clinical application of this viral vector. For example, should rAAV primarily 
25 persist as an randomly integrated provirus, the potential for insertional 

mutagenesis could present a major theoretical obstacle in the use of this vector 
due to the potential for mutational oncogenesis. The demonstration that rAAV 
can persist as episomes suggest that random integration and associated risks of 
malignancy may not be a major concern for this viral vector system. 
30 Additionally, the molecular determinants of AAV circular intermediates 

associated with increased persistence in cell lines appear to be contained within 
the DNA elements encompassing the inverted ITRs. The isolation of this 
naturally occurring viral DNA element, which forms as part of the AAV life 
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cycle and acts to stabilize circular episomal DNA, may prove useful in 
increasing the efficacy of both viral and non-viral gene therapy vectors. 

ExampkJ 

5 Evidence for Increased Rnisomal Persistence of A a v rir C ,ii fl r w.rmgHi^ j n a 
Mo del for in utero Plasmid-Based Gen e Therap y 

Persistence of AAV circular intermediates were assessed by injection of 
plasmid DNA directly into the pronucleus of fertilized Xenopus oocytes. 
Twenty-five ng of the p81 isolate of AAV circular intermediates was injected at 
1 0 the single cell stage of fertilized Xenopus oocytes. This plasmid was compared 
to the proviral plasmid pCisAV.GFP3ori, which contains two ITRs separated by 
stuffer sequence in an alternative confirmation to ITRs in p81. Figure 13 depicts 
the persistence of GFP plasmids as assessed by direct fluorescence of GFP. At 
this state of tadpole development, the fertilized oocyte has expanded from a 
1 5 single cell to approximately 1 0 6 cells. 

These studies confirm that AAV circular intermediates (p81) confer a 
higher level of stability in development Xenopus oocytes than plasmids 
containing similar transcriptional elements and ITR sequences in an alternative 
confirmation. Given that in the case of p81 injected oocytes, tadpoles are 
20 completely fluorescent, the data suggests that some level of integration may have 
occurred. 

Example 4 

Liposome Mediated Transfer o f Vectors of the Invention to t he Airway anH 

25 Muscle 

Studies evaluating the mechanisms of recombinant adeno-associated 
virus (AAV) transduction have identified a novel molecular intermediate 
responsible for episomal persistence. This intermediate is characterized by a 
circularized AAV genome with head-to-tail ITR repeats. Circular intermediates 
30 of rAAV were identified using a recombinant shuttle vector capable of 

propagating circularized viral genomes in bacteria. Pivotal experiments in cell 
lines demonstrate that the formation and persistence of these circular 
intermediates are augmented in the presence of helper adenovirus. These 
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findings suggest that cellular factors induced by adenoviral gene expression may 
modulate both the formation and/or persistence of AAV circular intermediates. 
Furthermore, studies in muscle have demonstrated that following rAAV 
infection, the formation and persistence of AAV circular intermediates correlates 
5 with the onset and maintenance (at 80 days) of transgene expression, 
respectively. Moreover, a 300 bp fragment encompassing the head-to-tail 
inverted ITR repeats found in AAV circular intermediates when cloned into 
heterologous expression plasmids can confer increased stability to those 
plasmids in HeLa cells. The structural aspects of AAV circular intermediates 
10 may lead to development of non-viral, plasmid based, gene transfer vectors with 
increased persistence of transgene expression. 

To determine whether AAV circular intermediates which differ in length 
and/or sequence of the ITR array are more efficacious plasmid based vectors for 
liposome-mediated gene transfer to the airway and muscle, several distinct forms 
1 5 of AAV circular intermediates are evaluated as plasmid-based delivery systems 
in three model systems of the airway including: 1) in vitro polarized primary 
airway epithelial monolayers, 2) mouse lung, and 3) human bronchial 
xenografts. Persistence is evaluated at both the level of transgene expression 
(using GFP and luciferase reporters) and at the level of episomal and integrated 
20 transgene derived DNA. Studies are performed to assess whether integration can 
be specifically enhanced by co-transfection with Rep DNA or mRNA. These 
studies also evaluate both the extent of integration and site specificity to AVS1 
sites in chromosome 19 of human model systems. 

Gene therapy using plasmid-based delivery systems have encountered 
25 several obstacles to efficient transgene expression. These obstacles include 
transient expression of transgenes and rapid degradation of DNA. In contrast, 
viruses have developed efficient mechanisms for transducing cells and 
expressing encoded viral genes. The molecular characteristics of AAV circular 
intermediates which confer increased persistence of transgene expression include 
30 a DNA element encompassing the head-to-tail ITR. Based on the findings that 
circular intermediates have increased episomal persistence in muscle following 
rAAV transduction, these structures may also have increased persistence as 
plasmid-based vehicles to the airway. Interestingly, several naturally occurring 
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mutations which are found in approximately 50% of AAV circular intermediates 
affect the stability of the intermediate. 

Several findings evaluating the efficiency of AAV circular intermediate 
formation from recombinant viral vectors have suggested that these structures 
5 are augmented in abundance by the presence of the E2a adenoviral gene product. 
These molecular structures may represent preintegration intermediates which, in 
the case of wild-type AAV, would efficiently integrate into the cellular genome 
by Rep facilitated mechanisms. However, in the case of recombinant AAV 
genomes (in the absence of Rep proteins), evidence suggests that these structures 
10 have increased episomal stability. To test whether exogenous addition of Rep 
and/or E2a can increase the efficacy of AAV circular intermediates by 
modulating their stability and/or integration, co-transfection methods with Rep 
encoding plasmids and mRNA are conducted. Additionally, exogenously 
supplied E2a DNA binding protein (DBP) may also enhance stability of AAV 
15 circular intermediates. Rep may increase the integration of circular 
intermediates while E2a may increase their episomal stability. Several 
observations including the association of E2a DBP with AAV genomes in the 
nucleus support a direct interaction between DBP and AAV circular 
intermediates. Furthermore, if DBP associates with AAV circular intermediates, 
20 its encoded nuclear localization sequence (NLS) may enhance nuclear 

sequestration of these plasmids in the nucleus. Alternatively, E2a may act to 
alter the persistence of AAV circular intermediates through the induction of 
cellular factors which interact with the ITR array. 

Liposome mediated gene transfer to the airway has considerable 
25 advantages due to the low level of toxicity. However, limitations include 
transient low level expression in differentiated airway epithelia. Despite this 
apparent limitation, several laboratories have had considerable success with the 
use of cationic liposome-mediated gene transfer in several animal models 
including mouse and rat lung, and numerous laboratories have pursued clinical 
30 trials, which suggested that these vehicles may show promise for gene therapy of 
the cystic fibrosis (CF) lung. Thus, delivery of the present vectors in plasmid 
form via liposomes may be a safe and effective vehicle for gene transfer to the 
airway. 
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To assess whether AAV circular intermediates may also have increased 
persistence in airway epithelial cells as seen in Hela cells, several distinct forms 
of circular intermediates delivered by liposome-mediated transfection into 
primary airway epithelial cells, are evaluated. Based on the diversity of ITR 
5 repeat elements between various isolated circular intermediates (i.e., including 0, 
1, 2, and 3 ITRs), circular intermediates isolated from later time points in muscle 
may have been naturally selected for increased stability in vivo. Hence, the 
structural consistencies between AAV circular intermediates are identified which 
give increased persistence as plasmid based vectors for gene transfer. 
10 Circular intermediates containing the GFP reporter gene and I, 2, and 3 

ITRs are transfected into primary airway cultures and polarized epithelial cell 
monolayers using the cationic lipid GL-67 (Genzyme Inc.). DNA to lipid ratios 
are optimized using a luciferase reporter. Additionally, the addition of EGTA, or 
the use of calcium-free media, can increase the extent of gene transfer about 10- 
15 fold, and may be included to enhance gene transfer to polarized epithelial * 
monolayers. To evaluate persistence and expression of transgenes from circular 
intermediates, direct fluorescent microscopy and Southern blotting of both Hirt 
and genomic DNA with GFP P 32 -labeled probes are utilized. Proliferating 
cultures of primary airway epithelial cells can be passaged up to 4 times during 
20 this analysis. In contrast, polarized epithelial monolayers are evaluated at 1 
week intervals for DNA persistence for up to 6 weeks. Since GFP transgene 
expression may be low and difficult to detect by direct fluorescence, GFP is 
quantitated by fluorometer of cell lysates. 

Following AAV transduction, circular intermediates may form within 
25 cells and certain structures of these intermediates may persist by virtue of 

affinity for cellular factors which bind at ITR arrays. If this is true, then it may 
be possible to select for and isolate optimal circular intermediates with increased 
persistence in airway cells by batch screening of circular intermediates pools 
from rAAV infected airway epithelia. 
30 Primary airway epithelia cell cultures are infected with AV. GFP3ori 

(MOIs of 1000 to 1 0,000 DNA part/cell) and low molecular Hirt DNA is 
prepared at 5-15 days post-infection. Hirt DNA containing circular 
intermediates from rAAV infected cells is used to then transfect primary airway 
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epithelial cells from which Hirt DNA is prepared at 5-15 days post-transfection. 
This second Hirt isolation is then used to isolate replication competent plasmids 
following transformation into bacteria. This selection process may give rise to 
those populations of circular intermediates with increased episomal persistence 
5 in airway epithelial cells. Selected clones of circular intermediate plasmids 
isolated by this procedure are then tested individually for increased persistence 
following liposome mediated transfection. These studies are performed in a 
batch type screening in 24 well plates using two serial passages for persistence. 
Once plasmids having increased persistence are isolated, their structure and 
1 0 sequence of ITR arrays are characterized. Since screening is performed on 
small-scale cultures, it may be necessary to implement semi-quantitative 
screening for DNA persistence within the first round of transfection using PCR 
methods. Candidate plasmids with a high level of increased persistence as 
compared to control plasmids which lack ITR sequences but contain the identical 
15 promoter-reporter element, are evaluated on a larger scale transfection amenable 
to analysis by Southern blotting of total DNA. 

To evaluate selected circular intermediate structures in vivo, two models 
including mouse lung and the human bronchial xenograft are employed. 10 wk 
BalbC mice are transfected with GL-67/DNA complexes at a ratio of 25 \ig 
20 plasmid/25 \ig lipid in an iso-osmotic solution of Dextrose. At 1, 5, 10, 15, and 
20 days post-transfection lungs of mice are harvested for immunofluorescent 
detection of GFP in formalin fixed sections and for quantitative fluorometry of 
tissue lysates. Southern blots are employed to evaluate the persistence of 
plasmids in Hirt and genomic DNA. In addition to evaluating the persistence of 
25 selected circular intermediates which have the highest level of persistence with 
in vitro models, luciferase constructs are evaluated in which the ITR array has 
been cloned either 5' or 3' to the reporter gene. Furthermore, the use of 
luciferase reporters allows for more sensitive assessment of transgene activity in 
cell lysates. 

30 Similarly, in vivo persistence of transfected circular intermediates and 

heterologous plasmids containing ITR arrays found within circular intermediates 
is evaluated in human bronchial xenografts. 
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Findings evaluating the effects of adenoviral co-infection on circular 
intermediate formation and persistence have suggested that E2a DBP leads to a 
10-fold increase in the abundance of circular intermediates as compared to E2 
deleted virus. Furthermore, studies with El -deleted virus have demonstrated that 
5 the persistence of circular intermediates in Hela cells is increased at 72 hours 
post-infection. These studies suggest that E2a DBP may augment circular 
intermediate formation and/or increase the stability of these structures by an 
unknown mechanism. E2a DBP may interact directly with circularized genomes 
and/or induce cellular factors which interact with sequences in these AAV 

10 genomes. Since DBP encodes an NLS, this protein may act to shuttle circular 
intermediates to regions of nucleus that allow for increased stability of these 
structures. NLS sequences have been shown to cooperatively interact with 
nucleolar targeting sequences and hence we will also evaluate if subnuclear 
targeting is important in maintaining the increased stability of circular 

15 intermediates containing ITR arrays. Furthermore, it is currently unknown 
where circular intermediates form in the cell and it remains plausible that they 
may form in the cytoplasm or nucleus. Hence, if DBP associates directly with 
circular intermediates, it may act as an NLS for DNA to enter the nucleus as 
well. 

20 Several in vitro reconstitution models are used to investigate the 

interaction of circular intermediates with DBP and their affect on in vivo 
persistence following DNA transfection in Hela cells. Furthermore, results 
evaluating the affects of various mutant adenoviral vectors on circular 
intermediate and Rfm/Rfd formation have suggested that these two types of 

25 intermediates occur by independent pathways indicative of latent and lytic 

infection, respectively. In the setting of wild type AAV, circular intermediates 
may be pre-integration complexes, which in the presence of Rep, efficiently 
integrate into the host genome. In contrast, in the absence of Rep, circular 
intermediates may accumulate episomally in rAAV infected cells. To this end, 

30 methods of supplementing Rep function may be capable of enhancing integration 
of plasmid based delivery of AAV circular intermediates. Thus, experiments in 
which co-transfection of circular intermediate plasmids with Rep expression 
plasmids or mRNA are conducted. 
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To investigate whether DBP can augment the stability of circular 
intermediates by increasing targeting to the nucleus, a Hela cell line (gmDBP6) 
is utilized which encodes an inducible E2a gene under a dexamethasone 
responsible element. This cell line gives rise to high levels of DBP in nuclear 
5 extracts by Western blot following treatment with dexamethasone. gmDBP6 
cells (+/- DEX) are transfected with various AAV circular intermediate plasmids 
containing 0, 1, 2, and 3 ITRs and total cellular and nuclear plasmid content 
evaluated by subcellular fractionation using Southern blotting against GFP 
probes. The time course of these studies is initially within the range of 12 hours 
10 to 4 days post-transfection. Transgene expression is evaluated by fluorometry 
(in cell lysates), and fluorescent microscopy (in viable cells), for GFP and 
luminescence for luciferase. Hela cells have demonstrated that immediate 
increases in transgene expression from AAV GFP circular intermediates as 
compared to control GFP plasmids occur as early as 24 hours post-transfection. 
1 5 Thus, certain cellular factors may facilitate an immediate accumulation of 
circular intermediates in the nucleus. DBP may invoke this increase by either 
direct interactions with ITR sequences or by the induction of cellular factors. To 
evaluate the potential for direct interactions between DBP and circular 
intermediates, various form of ITR arrays found within circular intermediates are 
20 end-labeled with y-ATP 32 and evaluated for binding by electrophoretic mobility 
shift assays to nuclear extracts from gmDBP6 cells (+/- DEX). Supershifts, with 
DBP antibodies and competition experiments with cold ITR sequences and non- 
specific DNA, are used as controls for specific binding. 

In a second model system aimed at evaluating the potential of DBP for 
25 shuttling and/or sequestering of circular intermediates to the nucleus, 

microinjection experiments in oocytes are performed with 50 ng of plasmid 
DNA of circular intermediates with and without 50 ng of DBP mRNA. 
Experiments initially evaluate the time course of GFP transgene expression (+/- 
DBP cRNA) by direct fluorescent microscopy. If major differences are seen, 
30 quantitative fluorometry of individual whole oocytes in 96 well plates is 

conducted. Similar studies on nuclear targeting in the presence of DBP can also 
be evaluated in this model by pooling microinjected oocytes for nuclear isolation 
and Southern blot analysis. 
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A third experimental model to evaluate nuclear targeting and/or 
accumulation of circular intermediate vectors in the presence and absence of 
DBP involves the microinjection of fluorescently labeled plasmid DNA into the 
cytoplasm and real time imaging to follow the nuclear accumulation of DNA. 
5 The DNA fluorescent dye, TOTO-1, is used to label DNA prior to injection. 
This dye forms an extremely stable complex with negligible diffusion and re- 
incorporation into nuclear DNA following transfection into polarized airway 
epithelial cell monolayers. Co-localization of DBP with wtAAV DNA genomes 
at focal hot spots within the nucleus supports the observation that nucleolar 
10 targeting may be important for persistence. These experiments are also 

performed in primary airway epithelial cells and in vivo models of the airway by 
either co-transfection of circular intermediates with DBP expressing plasmids 
and/or mRNA. 

The effects of Rep co-transfection on the integration of circular 
15 intermediate plasmids is also evaluated. Two methods are used to express Rep 
including: 1) co-transfection with Rep expressing plasmids, and 2) co- 
transfection with Rep encoding mRNA. Initially, Hela, CFT1, and IB-3 cells are 
tested, as transformed cells may be more amenable to expansion and evaluation 
of integration. Both CFT1 and IB-3 cells represent airway epithelial cells. 
20 Experiments are performed by cationic liposome (GL-67) mediated transfection 
of circular intermediate DNA with varying doses of a Rep-containing expression 
vector, e.g., pCMVRep. The extent of integration is also evaluated by two 
criteria, Southern blotting of Hirt and genomic DNA and clonal expansion of 
GFP expressing cells. Since Southern blot has an approximate limit of 
25 sensitivity of 1 integrated plasmid molecule per 1 0 cellular genomes, clonal 
expansion may be necessary to evaluate persistence in less transferable cells 
such as CFT1 and IB-3 cells. Cell lines are evaluated over the course of 1-10 
passages. 

Sustained expression of Rep by plasmid mediated co-transfection may be 
30 toxic to cells, hence co-transfection with Rep mRNA is also evaluated. Cationic 
liposome:mRNA mediated transfection has been previously shown to work in 
cell lines and although the level of expression is much more transient than for 
DNA, in these studies it may be an advantage. Initial studies are performed with 
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in vitro transcribed .Rep mRNA alone to evaluate the jig amount of mRNA 
needed for Rep expression as determined by Western blot. Once the threshold 
for detectable Rep expression is established, increasing amounts of Rep mRNA 
are co-transfected with circular intermediate DNA. Similar assays are used as 
5 described above to evaluate the extent of AAV circular intermediate integration. 
If findings suggest that increased integration if facilitated by Rep, the site 
specificity of this integration can be evaluated by cloning GFP expressing cells 
after the 10th passage by serial dilution. These GFP expressing clones are 
expanded and genomic Southern blots assessed with both GFP and AVS1 

10 specific probes. By evaluating a number of restriction enzymes which either do 
not cut or cut once within the circular intermediate plasmid, it will be determined 
whether integration has occurred at the AVS1 loci. 

To test whether secondary structure rather than primary sequence is the 
important determinant of increased episomal stability of AAV circular 

15 intermediates, synthetic DNA sequences are generated with identical secondary 
structure to several ITR arrays in circular intermediates. The primary sequence 
is completely altered and bares no resemblance to sequences contained within 
native AAV ITRs. These synthetic DNA sequences are tested for their ability to 
confer increased episomal stability to heterologous plasmids in several model 

20 systems including: 1) the airway, 2) muscle, 3) and developing Xenopus 

embryos. The developing Xenopus embryo model is ideal for testing integration 
and persistence of plasmid based vectors for application of in utero gene therapy. 
If synthetic DNA sequences with similar secondary structure to ITRs are found 
to confer increased persistence to plasmid based vectors, then determinants for 

25 protein binding which facilitate persistence are independent of primary base 
sequence. These studies allow the optimization of the secondary structural 
requirements by synthesizing a wide range of DNA molecules with varying 
degrees of palindromic repeats. Furthermore, the secondary structure may not 
bind proteins directly but facilitate recombination of plasmids to large 

30 concatamers which have increased episomal stability or enhanced integration 
efficiencies. 
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Examples 

Delivery of Multinlff Genes thr ough rnt ermoieculflr Conr^m^y^ \ m 
Methods 

Recombinant AAV vectors 
5 Two rAAV vector stocks were generated for use in these studies, 

AV.GFP3ori (Example 1) and AV.Alkphos (also known as CWRAPSP, a gift of 
Dusty Miller) (Halbert et al., 1997). Virus stocks were generated by co- 
transfection of 293 cells with either pCisAV.GFP3ori or pCWRAPSP along with 
pRep/Cap, followed by co-infection with recombinant Ad.CMVlacZ helper virus 
10 (Example 2). rAAV was then purified through three rounds of CsCl density 
gradient centrifugation as previously described by Duan et al. (1997). Purified 
viral fractions were heated at 60°C for 1 hour to inactivate any residual 
contaminating helper adenovirus. The yields for AV.GFP3ori and AV.Alkphos 
were 1 x 10' 2 and 7 x 10" particles per ml, respectively, as determined by slot 
15 blot hybridization with 32 P-labeled GFP or Alkphos probes. Infectious titers 
determined by infection of 293 cells with rAAVs were 1.1 x 10 9 IU/ml 
(AV.GFP3ori) or 8.6 x 10 8 IU/ml (AV.Alkphos). Controls testing for 
contamination of rAAV stocks with wtAAV by anti-Rep immunocytochemical 
staining in rAAV/Ad.CMVlacZ co-infected 293 cells were negative (limit of 
20 sensitivity is less than 1 infectious wtAAV particle per 10 10 DNA particles of 
rAAV). Similarly, histochemical staining for p-galactosidase in rAAV infected 
293 cells showed no detectable contamination with helper adenovirus in 10 10 
DNA particles of rAAV (limit of sensitivity). 
Infection Of muscle tissue and evaluation ^ of transg fn p expression 
25 The C57BL/6 mice used for these experiments were housed in a virus- 

free animal care facility and were maintained under strict University of Iowa and 
NIH guidelines, using a protocol approved by the Animal Care and Use 
Committee and facility veterinarians. Four to five week old mice received 
bilateral 30 jd injections of a mixture of both AV.GFP3ori and AV.Alkphos into 
30 the tibialis anterior muscle (5 x 10 9 DNA particles of each virus per muscle). 
Controls included uninjected muscles and muscles receiving injections of one of 
the viruses alone. At 14, 35, 80, and 120 days post-infection, animals were 
euthanized and tissues were harvested for evaluation of transgene expression and 
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preparation of low molecular weight Hirt DNA. For each experimental time 
point, at least 3 independently injected muscles were evaluated. 

In all experiments, GFP fluorescence was visualized in freshly excised 
muscle tissue prior to processing. A portion of the same muscle was fixed with 
5 2% paraformaldehyde in phosphate buffered saline, and cryoprotected in graded 
sucrose solutions before embedding in optimal cutting temperature medium 
(OCT). Sections (6 ^m) were then evaluated for GFP expression directly and 
Alkphos expression following heat inactivation of endogenous Alkphos and 
histochemical staining for Alkphos activity (Engelhardt et al., 1995). To confirm 
10 dual localization of GFP and Alkphos expression in the same muscle fibers, 
either serial sections were evaluated for GFP and Alkphos expression or the 
same section was first photographed for GFP expression followed by 
histochemical staining for Alkphos and re-imaging of the same field. 
Res cue of circular intermediates from muscle Hirt DNA 
1 5 Low molecular weight Hirt DNA was prepared from 20 mg specimens of 

injected muscles from 3 animals at each time point (Example 2). Hirt DNA 
(4 1/5 of the total volume) was then used to transform 50 ^1 of 
electrocompetent SURE cells (Stratagene) using a BioRad E. coli electroporater 
and 0.1 jim cuvettes. Colonies resulting from each bacterial transformation were 
20 quantified, and plasmids from 20 colonies from each muscle Hirt DNA sample 
were purified for analysis. It should be noted that only circular forms carrying 
the Amp resistance gene and the bacterial origin of replication from AV.GFP3ori 
are rescued by bacterial transformation (Duan et al., 1 998). Control experiments 
reconstituting 5 x 10 10 viral DNA particles into uninfected muscle extracts prior 
25 to Hirt DNA preparation failed to give rise to replication competent plasmids in 
the rescue assay (Duan et al., 1998). Additional controls in Duan et al. (1998) 
using AV.GFP3ori virus also demonstrated that linear double stranded and single 
stranded purified viral DNA genomes do not give rise to replication competent 
plasmids following transformation into Ecoli. 
30 Characterization of encoded g enes in rescue d circular intermediates . 

Several assays were used to characterize the extent of intermolecular 
recombination between independent circular viral genomes by evaluating the 
number and type of encoded genes in rescued plasmids from Hirt DNA of 
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muscles co : infected with AV.GFP3ori and AV. Alkphos. Initial analysis 
involved the bulk evaluation of 60 rescued plasmids (20 from each of three 
muscle samples for each time point) by dot blot hybridization of mini-prep DNA 
with EGFP, Alkphos, and Amp 32 P-labeled DNA probes. In these studies, Amp 
5 hybridization served as a control to show that there was a sufficient quantity of 
DNA for the analysis. The percentages of Alkphos and/or GFP hybridizing 
plasmids were calculated by this method for each muscle sample. From this 
percentage, the total number of plasmids hybridizing to each probe in the Hirt 
DNA sample was calculated from the total CFU obtained in each transformation. 

10 In this analysis, each muscle sample was evaluated independently to determine 
the mean (+/-SEM) total Alkphos and/or GFP hybridizing plasmids. A second 
evaluation involved the transfection of rescued plasmids into 293 cells using 
lipofectamine, followed by evaluation of GFP fluorescence and histochemical 
staining for Alkphos. To confirm that GFP and Alkphos co-expressing plasmids 

1 5 were indeed clonal and that both genes were encoded on the same plasmid, a 
selected group of five co-expressing plasmids were retransformed into E.coli and 
colonies were re-isolated prior to repeating the transfection studies. In all cases, 
plasmids co-expressing the two reporter genes remained clonal through this 
subsequent re-isolation. 

20 Structural analysis of concatarner r A A V circular intermediates. 

To further characterize the nature of isolated circular intermediates co- 
expressing both GFP and Alkphos transgenes, plasmid structure was mapped by 
Southern blotting and restriction enzyme analysis. The structural of five co- 
expressing circular intermediate plasmids were determined by digestion with 

25 Ahdl, Hindffl, NotI, Hindlll/NotI, Clal/Asel, and/or SnaBI and Southern 
blotting was performed with 32 P-labeled GFP, Alkphos, and ITR probes. 
Results 

Strateg y for chara cterizin g mechanisms of rAAV circ ular intermediate 
formation . 

30 Efficient circularization of rAAV genomes has been previously 

demonstrated to occur in muscle in a time dependent fashion (Example 2). 
Furthermore, the conversion of monomelic to multimeric circular rAAV 
intermediates occurred over time and was associated with long-term episomal 
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persistence of AAV genomes. High molecular weight AAV circular genomes 
might form by either of the following two mechanisms, one involving the 
• replication of monomer structures and the other through intermolecular 

recombination between independent monomers. A rescue assay was developed 
5 using two separate rAAV vectors, AV.GFP3ori and AV.Alkphos (Figure 14A), 
which allowed for the identification of independent viral genomes through 
unique transgenes. In this assay, circular form genomes were rescued in bacteria 
by virtue of Amp/ori sequences encoded in one of the two vectors 
(AV.GFP3ori). A method for characterizing the extent of intermolecular 
1 0 recombination between independent circular rAAV genomes was shown in 
Figure 14B. 

Co-expression of independentl y encoded rAA V transg ene s in mu.d R my nfihers 

To confirm that myofibers can be co-infected at a high efficiency with 
the two rAAV vectors, the tibialis anterior muscle of mice was co-infected with 
15 5 x 10 9 DNA particles of both AV.GFP3ori and AV.Alkphos. At 14, 35, 80, and 
120 days post-infection, muscles were harvested and analyzed for transgene 
expression. Transgene expression from both reporters was weak but clearly 
visible in 14 day muscle samples. By 80 days post-infection, transgene 
expression was maximal and serial sections demonstrated expression of both 

20 Alkphos and GFP transgenes in overlapping regions of the muscle (Figures 1 5 A- 
C). At this time point, approximately 50% of the fibers in the tibialis muscle 
expressed both transgenes. To confirm that co-infection of myofibers occurred 
with the two independent vectors, co-localization studies were performed on 
muscle sections by a serial staining procedure. These studies, depicted in Figure 

25 1 5D, demonstrate four classes of myofiber transgene expression: 1) GFP positive 
only, 2) Alkphos positive only, 3) GFP/Alkphos positive, and 4) no transgene 
expression. The largest fraction of myofibers expressed both GFP and Alkphos 
transgenes. These results confirm that at the titers of virus used for infection, co- 
infection occurred in greater than 90% of transgene expressing myofibers. 

30 Rescue o f hi-ftmctional rAAV circular intermediates increases over time 
To determine the extent of recombination between circular AAV 
genomes, circular form genomes were rescued as plasmids from low molecular 
weight Hirt DNA of muscle tissue co-infected with AV.GFP3ori and 
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AV.AIkphos. Following transformation otE.coli Sure cells with Hirt DNA 
purified from infected muscles, the total number of GFP and Alkphos 
hybridizing Amp resistant bacterial plasmids was quantitated for each time point 
post-infection (Figure 16A and B) (Duan et al., 1995), the abundance of circular 
5 AAV genomes rescued from AV.GFP3ori increased over time. For each muscle 
sample (three for each time point) twenty plasmid clones were evaluated for 
hybridization to GFP and Alkphos DNA probes and the total number of plasmids 
was back calculated from the total CFU for each individual muscle sample. 
Figure 16B demonstrates the mean (+/-SEM, N=3) total plasmids that hybridized 
10 to GFP or GFP/Alkphos probes at each time point. At 14 days post-infection, 
GFP/Alkphos co-hybridizing plasmids were never observed. In contrast, at time 
points after 35 days the percentage of GFP/Alkphos co-hybridizing plasmids 
increased with time and reached 33% by 120 days (Figure 16C). Since bacterial 
plasmid rescue can only occur through AV.GFP3ori genomes, this data suggests 
1 5 that recombination between independent Alkphos and GFP rAAV genomes takes 
place over time. These results are consistent with studies described hereinabove 
demonstrating a time dependent concatamerization of monomer circular rAAV 
genomes in muscle. 

To evaluate the ability of circular intermediates to express encoded 
20 transgenes, transient transfection studies were performed in 293 cells with 
rescued circular intermediate plasmids (Figures 17A-C). Between 85-90% of 
rescued plasmids hybridizing to GFP probes on slot blots also expressed the GFP 
transgene in this transfection assay (Figure 17D). The percentage of GFP 
expressing plasmids that also expressed Alkphos rose over time in concordance 
25 with the hybridization data (Figure 17D). However, approximately 40-50% of 
plasmids which were hybridization positive for Alkphos did not express the 
Alkphos transgene. This may represent recombinational deletion of the RSV 
promoter driving Alkphos expression which occurred during concatamerization 
at sites near the 5' ITR. These results demonstrate that intermolecular 
30 recombination between Alkphos and GFP derived circular intermediates occurs 
as part of the time dependent concatamerization process of rAAV in muscle. To 
confirm that amplified plasmids stocks expressing both reporter genes were 
actually clonal (i.e., one plasmid rather than two independent plasmids resulting 
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from contamination), a select number of bacterial clones expressing both 
transgenes were re-isolated and the transfection assays were repeated. In all 
cases, plasmids expressing the two reporter genes remained clonal through two 
rounds of bacterial cloning. Hence, dual reporter expression was not due to 
5 contamination of independent GFP and Alkphos expressing plasmids. 

Hnnratamerization of AAV circular intermediates occurs through uniform 
iritftrmo1e.ciilar recombination betwe en TTRs of independent viral genomes. 

To better understand the mechanisms of circular concatamer formation, a 
detailed structural analysis was performed of five bi-fiinctional circular 

10 concatamers isolated from rAAV infected muscle samples. As previously 
described for the AV.GFP3ori genome (Example 2), the conversion of 
monomelic circular AAV genomes to large multimeric circular concatamers with 
a predominant head-to-tail structure increased with time in muscle. To evaluate 
the structure of bi-functional circular concatamers, restriction enzyme mapping 

15 and Southern blot analysis using 32 P-labeled EGFP, Alkphos, and ITR probes 
was employed. Results from five analyzed plasmids demonstrated between 3-6 
genomes within these circular concatamers. Two representative structures from 
35 and 80 day time points are shown in Figure 18. Several interesting 
conclusions can be made from this structural analysis. As described , head-to- 

20 tail oriented genomes could be seen in all isolated concatamers. However, 
several examples of head-to-head and tail-to-tail genome combinations of 
AV.Alkphos and AV.GFP3ori were also seen. Since head-to-head and tail-to- 
tail genome concatamers were never seen in muscles infected with AV.GFP3ori 
alone, there must be a selective disadvantage for bacterial replication when ori 

25 sequences are in either of these conformations. However, since the AV.Alkphos 
genomes do not contain a bacterial origin of replication, this orientation is 
permitted in chimeric concatamers. Second, noticeable deletions and/or loss of 
restriction sites close to ITRs were noted (Figure 17). It is not known whether 
deletions close to the ITR are a common event in the concatamerization process, 

30 but if so, this could account for the fact that only 60% of GFP/Alkphos 
hybridizing circular intermediates also expressed the Alkphos transgene. 
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Discussion 

Concatamerization of rAAV genomes has long been recognized in 
integrated proviral genomes. Recently, the association of this concatamerization 
process with the formation of high molecular circular genomes in muscle has 
5 suggested that this process may also be important in episomal persistence. The 
findings described herein demonstrated rescue of independent viral genomes 
within the same circular concatamer, suggesting that this process of 
concatamerization occurs through intermolecular recombination. Furthermore, 
at 14 days the predominant form of viral genome in muscle was circular 
10 monomers (Example 2), which correlates with the results described above 

demonstrating only GFP expression in rescued circular intermediates at this time 
point. Together with the fact that bi-functional rescued circular concatamers 
increase with time, these results suggest that large concatamers form by 
recombination of monomeric circular precursor genomes. Furthermore, since an 
15 alternative model of concatamerization by rolling circular replication would be 
expected to yield only GFP expressing rescued plasmids in this system, this 
mechanism does not appear responsible for concatamerization. 

Based on the structural analysis of these bi-functional circular 
intermediates, recombination between monomeric circular rAAV genomes is 
20 likely facilitated through ITR sequences. Directionality of this recombinational 
event does not appear to play a significant role, since head-to-tail, head-to-head, 
and tail-to-tail oriented intermolecular concatamers were found. In addition, the 
extent to which recombination within ITR repeat regions occurs in bacteria is 
presently unknown and may account for the deletions and/or restriction site 
25 losses near ITR arrays. However, serial passaging of bi-functional circular AAV 
genomes in bacteria has suggested that the structure of these large concatamers is 
impressively stable in bacteria. 

Intermolecular recombination of rAAV genomes to form single circular 
episomes may be particularly useful for gene therapy. For example, large 
30 regulatory elements and genes beyond the packaging capacity of rAAV may 
become linked after co-infecting tissue with two independent vectors 
(Figure 19). This strategy could also involve trans-splicing vectors encoding two 
independent regions of a gene which are brought together to form an intact 
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Example 6 

Enhancement of Recombinant AAV Mediated Gene Expression 
5 Through Intermoleml qr Ci.r-ar.tivatinn 

Materials and Methods 
Recombinant AAV vectors 

The pcisAV.Luc proviral plasmid was generated by cloning the 1983 bp 
Nhel/BamHI fragment from pGL3-Basic (Promega), containing the luciferase 
1 0 gene and S V40 late poIyA signal, by blunt-end ligation into the blunted Xba I 
site of pSub201 (Samulski et al., 1987). Similarly, pcisAV.SV(P)Luc was 
generated using a blunted 2175 bp Nhel/BamHI fragment, from the pGL3- 
Promoter (Promega), containing the SV40 promoter, luciferase gene, and SV40 
late polyA signal. The pcisAV.SV(P/E)Luc plasmid was generated by blunt- 
15 end ligation of a 2427 bp Nhel/Sall fragment from pGL3-Control (Promega) 
into the blunted Xba I site of psub201 . This construct contains the SV40 
promoter, luciferase gene, S V40 late polyA signal and SV40 enhancer. 

The "super-enhancer" vector, pcisAV.SupEnh, was produced using a 
two-step cloning process. First, a 0.62 kb blunted BglH/PvuI fragment 
20 containing the CMV immediate early enhancer from pIRES (Clontech) was 
subcloned into the blunted BamHI site in pGL3-Control (Promega) to make 
pGL3-Control-CMVenh. Then a 0.92 kb DNA segment containing both the 
CMV immediate early enhancer and the S V40 enhancer was released by 
Clal/Sall double digestion of pGL3-Control-CMVenh and subsequently inserted 
25 into the blunted PstI site of pcisAV.GFP3ori (Duan et al., 1998b). The resulting 
pcisAV.SupEnh plasmid contains the SV40 enhancer, the CMV immediate early 
enhancer, the P-lactamase gene, and a bacterial replication origin. The 
ampicillin resistance gene (P-lactamase) and bacterial original of replication 
were included in pcisAV.SupEnh to facilitate the subsequent rescue of circular 
30 AAV genomes from infected cells in bacteria. 

The control vector, pcisAV.AmpOri, was generated by blunt-end ligation 
of a 1 .1 kb Sail digested stuffer sequence from the human glycosylasparaginase 
cDNA into PstI digested pcisAV.GFP3ori. This plasmid has a structure similar 
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to that of pcisAV.SupEnh, except that it does not contain any enhancer elements. 
The pcisAV.AmpOri was used as a negative control for non-specific 
enhancement of transgene expression by intermolecular recombination of two 
different AAV vectors. 
5 The integrity of the ITR sequences in all the plasmids was confirmed by 

digestion with restriction enzymes, including Smal, MscI, and BssHII, which 
have unique cutting sites within different regions of ITR. All the viral 
preparations were obtained according to a method described in Duan et al. 
(1997). The quality of the viral stocks (i.e., contamination with adenovirus 
10 and/or wild type AAV) was confiimed as previously described in Duan et al. 
(1998b). The analyses showed less than 1 recombinant adenovirus and wt AAV 
infectious particles per 10 !0 particles of rAAV. Viral titers were determined by 
quantitative slot-blot hybridization using either luciferase, CMV enhancer, SV40 
enhancer, or ori probes for each of the respective vectors against plasmid copy 
1 5 number standards. 

Luciferase enzyme assays. 

Luciferase assays were performed from cell lysates harvested from either 
in vi'/ro-infected human fibroblasts or from in v/vo-infected mouse tibialis 
anterior muscle. Human fibroblasts were infected with virus in 60 mm dishes. 
20 These in v/'/ro-infected cells were harvested at 3 days post-infection by rinsing 
cells with PBS twice, and then incubating with lx Report lysis buffer (Promega) 
(400 jil per 60 mm plate) at room temperature for 15 minutes. Cells were 
scraped into an eppendorf tube and centrifuged for 30 seconds at 14,000 rpm. 
Serial dilutions of supernatants were assayed for luciferase activity using 
25 reagents from Promega according to the manufacturer's protocols. Luciferase 
activity was detected in triplicate for each individual sample with a Luminoraeter 
(TD-20/20, Turner Designs Instrument, Sunnyvale, CA), at the setting of 90.3% 
sensitivity, 1 second delay, 10 second measurement. Six independent samples 
were assayed for each experimental condition. 
30 For in vivo assay of luciferase activity, the anterior tibialis muscle of 8 

week old C57BL/6 mice was infected with an indicated amount and type of. 
AAV vectors (see Figure 23) in 30 \i\ phosphate-buffered saline (PBS). The 
entire muscle was harvested at 30 days or 90 days post-infection and weighed 
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prior to cell lysate preparation. The muscle tissue was frozen in liquid nitrogen 
and pulverized by hand grinding with an ice-cold porcelain mortar and pestle. 
The muscle was further minced and homogenized in 100 nl of lx Report lysis 
buffer with a hand-held plastic pestle for 2 minutes (Kontes, Vineland, New 
5 Jersey). After 1 5 minutes incubation at room temperature, the crude lysates were 
spun for 30 seconds at 14,000 rpm, and the supernatants were used for luciferase 
activity assay as described above. To minimize variability, all experimental 
samples were analyzed simultaneously using the same batch of luciferase assay 
reagents and were normalized to the protein content in the lysate. 
10 Results 

Co- a dministration of a c/.v-activating vector incr e ases rA AV mediated luciferase 

expression in fibroblasts. 

To test the hypothesis that c/s-activation from two independent AAV 
vectors can occur following intermolecular concatamerization, several rAAV 

15 vectors with defined regulator elements and/or the luciferase reporter gene were 
constructed (Figure 20). One of these vectors, AV.Luc, contains the luciferase 
transgene and an SV40 poly A signal but no promoter sequences. AV.SV(P)Luc 
is similar to AV.Luc except that an SV40 promoter (lacking the enhancer 
sequence) was inserted in front of the luciferase transgene. AV.SV(P/E)Luc 

20 includes both the SV40 promoter and enhancer, driving expression of the 
luciferase transgene, and was used as a control for maximal expression in the 
absence of intermolecular recombination with an enhancer containing vector. To 
evaluate intermolecular cw-activation by enhancer elements, an rAAV "super- 
enhancer" vector (AV.SupEnh) was also constructed, which contains SV40 and 

25 CMV enhancer regions without promoter or transgene sequences. A negative 
control vector (AV.AmpOri), which is similar to AV.SupEnh except that the 
enhancer sequences were replaced by a non-specific stuffer fragment, was also 
constructed. 

Enhancers are m-acting DNA sequences that can be recognized by 
30 regulatory proteins to stimulate transcription in a context independent manner 
relative to the promoter and transgene. If cells were co-infected with 
AV.SV(P)Luc and AV.SupEnh vectors, luciferase transgene expression could be 
significantly increased from the minimal SV40 promoter only if intermolecular 
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recombination had occurred between the two independent vectors (Figure 21). 
However, in accordance with the definition of an enhancer, no activation should 
occur if the enhancer sequences and the transgene cassette are located in separate 
circular DNA molecules (Lewin, 1997). 
5 Initial experiments were performed by infecting 1 x 10 6 primary human 

fibroblasts with single vectors [AV.Luc or AV.SV(P)Luc] at a multiplicity of 
infection (moi) equal to 1000 viral particles/cell. Additional experimental 
points included co-infection of AV.Luc or AV.SV(P)Luc with either 
AV.SupEnh or AV. AmpOri at the same moi. 
10 As shown in Figure 22, infection of fibroblasts with both the promoter- 

less [AV.Luc] and the minimal promoter [AV.SV(P)Luc] luciferase constructs 
alone gave only minimal expression at 3 days post-infection. However, co- 
infection with AV.SupEnh produced 16- and 35-fold inductions in luciferase 
expression from the AV.Luc and AV.SV(P)Luc vectors, respectively. Thus, cis- 
1 5 activation of a minimal S V40 promoter can occur in the presence of a second 
AAV vector containing enhancer elements. Unanticipated, however, was the 
high level of transactivation of the AV.Luc construct, which contains no 
heterologous promoter sequences. These findings support earlier studies 
suggesting that ITRs contain a cryptic promoter (Flotte et al., 1993). The 
20 specificity of this induction was further demonstrated by the lack of 

transactivation following co-administration of an alternative control vector, 
AV. AmpOri, which does not contain enhancer elements. 

To confirm that the transactivation observed was due to recombination of 
two independent AAV virus, Hirt DNA from infected cells was transformed into 
25 competent SURE E. Coli cells. As expected, no bacterial clones were retrieved 
from cells infected with either AV.Luc or AV.S V(P)Luc alone (neither vector 
contains amp r and ori sequences). However, in cells co-infected with 
AV.SupEnh (which contains amp r and ori sequences but no luciferase gene), 
approximately 4% of the rescued clones also contained the luciferase transgene, 
30 according to restriction enzyme mapping and Southern blotting analyses. 

Subsequent transfection of Hela cells with these rescued, circular concatamer 
plasmids demonstrated 103+/-6 fold higher luciferase activity from 
AV.Luc/AV.SupEnh, as compared to AV.Luc/AV.AmpOri recombined AAV 
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genome plasmids. Taken together, these results indicated that intermolecular 
concatamerization of a reporter rAAV virus with an independent "super- 
enhancer" AAV vector substantially increased the efficiency of transgene 
expression in cultured human primary fibroblasts. 
5 Intermolecular m-activation enhances AAV me diated luciferase expression in 
muscle tissue in vivn 

To confirm whether the in vitro findings could also be applied to increase 
rAAV mediated gene expression in vivo, 2 x 10 10 particles of A V.Luc or 
AV.SV(P)Luc were injected into the tibialis anterior muscle of C57BL/6 mice, 
1 0 either individually or in combination with AV.SupEnh (2 x 1 0 10 particles). The 
infected muscles were harvested at 30 and 90 days post-infection. Consistent 
with previous findings (Example 5), intermolecular recombination of circular 
rAAV genomes increased from about 5% at the 30 day time point to 25% by 90 
days post-infection. Importantly, co-administration of AV.SupEnh vector with 
1 5 either AV.Luc or AV.S V(P)Luc (Figure 23 A) led to a functional enhancement of 
transgene gene expression, with a 30-fold increase observed at 30 days post- 
infection. By 90 days post-infection, greater than 200-fold [AV.Luc] and 600- 
fold [AV.SV(P)Luc] increases in transgene expression were observed when 
AV.SupEnh was co-administered. 
20 Consistent with in vitro experiments in fibroblasts, the bacterially 

rescued concatamers containing both the luciferase transgene and AV.SupEnh 
also demonstrated greater than a 100-fold higher luciferase activity than the 
original pro viral luciferase plasmid (pcisAV.Luc) alone. Interestingly, 90 day 
muscles infected with 2 x 10 10 particles of the control AV.SV(P/E)Luc vector 
25 (which contains both the SV40 enhancer and promoter) produced luciferase 

levels that were 10- and 100-fold less than the levels seen following co-infection 
with AV.SupEnh/AV.Luc and AV.SupEnh/AV.S V(P)Luc, respectively. These 
findings underscore the potential of using large multi-enhancer segments to 
increase AAV mediated gene delivery through intermolecular cw-activation. 
30 Such applications will likely have broad implications for in vivo AAV mediated 
gene delivery to organs for which concatamerization of the AAV genome is an 
inherent process of its latent life cycle. 
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Both in vitro and in vivo results described herein demonstrated effective 
augmentation of transgene expression following intermolecular 
concatamerization between two independent AAV vectors carrying a transgene 
or enhancer sequences. A surprising finding was the strong induction of 
5 transgene expression from an AAV vector lacking an endogenous promoter 
(A V.Luc). Previous studies have implied weak promoter activity in the AAV 
ITR sequences (Flotte et ah, 1993). Hence, it is possible that, in combination 
with enhancer sequences supplied by another AAV vector, a therapeutic level of 
transgene expression could be achieved for disease genes approaching the 
10 maximum packaging capacity for AAV. One notable example would be in 
applications of AAV for cystic fibrosis. Additionally, a particular interesting 
extension of this work will be to use large cell-specific enhancer regions for 
targeted expression. 

15 Example 7 

Generation of rAAV Deliv e ry S y stems for Full-length CFTR anH fpq 
Based on AAV Circular Concatamer Formation 
Given the findings that circularization and concatamerization are integral 
parts of rAAV transduction in the airway, rAAV transduction, circularization 
20 and concatamerization can be used to deliver CFTR transgene cassettes to the 
airway. Two approaches may be employed in which various genetic elements of 
the CFTR transgene cassette are split into two or more vectors which are then 
used for co-infection of the airway. The first approach, "/raws-splicing", utilizes 
a rAAV vector harboring the promoter/enhancer driving the first half of CFTR 
25 DNA flanked by a donor splice site (vector- 1, Figure 19C) and a second rAAV 
vector harboring the second half of CFTR DNA and a polyA addition sequence 
preceded by a donor acceptor site (vector-2, Figure 19C). A second approach, 
"as-activation", employs a first vector that harbors the entire CFTR transgene 
with a minimal synthetic promoter and a second vector comprises several strong 
30 enhancer sequences. Through the process of concatamerization, these two 

vectors are brought into juxtaposition with one another, allowing for splicing or 
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ris-activation of enhancer/promoter combinations. 

Initial experiments assess the extent of concatamerization in epithelia and 
in cell lines. Two rAAV CFTR plasmid-based vectors were prepared, 
pAV.RVSCFTR donor and pAV. CFTR acceptor. The splice consensuses 
5 sequences are based on the large T antigen splice sites. A synthetically 
generated circular chimera was also constructed to represent rAAV circular 
concatamers. The chimera was generated by PCR-mediated approaches in which 
splice site consensuses were incorporated into the primers used to amplify each 
half of CFTR from pBQCFTR (Drumm et al., 1990). Several clones were 
1 0 sequenced in. their entirety and PCR errors removed by subcloning the 
appropriate correct fragment from pBQCFTR. Assays for splicing 

integrity include RNase protection assays of transfected MDCK cells for 
sequences within the large T antigen intron and surrounding regions. Polarized 
airway epithelial cells grown at the air-liquid interface are co-infected with the 
1 5 donor and acceptor CFTR AAV vectors and CFTR gene expression in these cells 
is then monitored by both immunofluorescent localization and functional 
analysis of short circuit currents (Smith et al. 3 1992; Smith et al., 1990). 
Additionally, functional assays using two electrode voltage clamp measurements 
(TEV) of oocytes following nuclear microinjection of the chimeric plasmid are 
20 used to demonstrate intact splicing and active CFTR channels (Zhang et al., 

1998). Hirt analyses of episomai AAV species are used to correlate the efficacy 
and persistence of CFTR gene expression with the formation of AAV circular 
intermediates. 

Several systems are utilized to evaluate the efficiency of intermolecular 
25 recombination to form functional CFTR expressing circular concatamers. The 
first employs short circuit measurements in polarized CF airway epithelia. 
Conditions for rAAV infection include basolateral infection of polarized CF 
epithelia, preferably in the presence of agents that enhance the level of transgene 
expression. A second model employs CF human bronchial xenografts to 
30 evaluate CFTR function complementation. Reconstitution of xenografts with 
UV-irradiated rAAV infected primary cells can resulted in approximately 50% 
transduction in differentiated xenografts.. Functional expression of CFTR in CF 
primary ainvays is evaluated by transepithelial potential difference (PD) 
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measurements (Jiang et al., 1998). 

In addition, a "m-activation" approach may be employed (Example 6) to 
utilize the concatamerization process to deliver full-length CFTR. For example, 
a synthetic minimal promoter driving full-length CFTR within one viral vector 
5 and a second independent vector encoding a tandem array of strong enhancer 
sequences such as RS V, CMV, and SV40 may be employed. 

A trans-splicing approach was employed to deliver the genomic epo gene 
to mice. The vectors are shown in Figure 25. The infection of mice with these 
vectors protected the mice from adenine induced anemia brought on by renal 
10 failure. 



Example fi 

Generation of rAAV Delivery Systems for Auton o mously Kpp lirating 
15 Enisomes Harhorinp F.RV Replfcarirm Origins. 

To prepare autonomously replicating circular episomes, e.g., to increase 
the stability of genes delivered via rAAV vectors, a rAAV vector comprising a 
replication origin of a circular episome is employed. For example, a rAAV 
vector comprising the EBV replication origin (OriP) and EBNA-1, the only viral 
20 protein needed to facilitate replication at this origin, is prepared (Vector 2, 
Figure 24). DNA fragments encoding EBNA-1 and OriP are excised from 
pREPIO (Clontech Inc.) and cloned into a pCisAV.RSV vector. Initially, virus 
is produced from pCisAV.EBNA-l/OriP construct and a reporter construct, e.g., 
AV.GFP3ori, and virus is used to co-infect Hela cells. Hela cells are infected 
25 with each of the vectors alone (Figure 24) or together (MOI = 1000 

particles/cell) and serially passaged. Transgene expression is quantitated 
following each passage and Hirt DNA is isolated for Southern blot analysis of 
episomal DNA persistence. If persistence of GFP expression is increased in the 
presence of co-infection with EBNA-1 rAAV vectors following serial passage, 
30 there will be an increased abundance of co-encoding EBNA-1 /GFP rescued 
plasmids with increasing passage number. 

Then primary human bronchial cells are infected with each of the vectors 
alone or together at an MOI - 10,000 particles/cell and infected cells seeded into 
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human bronchial xenografts. Persistence of GFP transgene expression is 
compared between infected xenografts at 1, 3, 5, and 8 weeks post- 
transplantation. Epithelia is also harvested for Hirt DNA analysis by perfusion 
of xenografts with SDS containing Hirt extraction buffer. Hirt DNA Southern 
5 blots using EBNA and GFP probes assess the extent of episomal persistence of 
monomer and concatamer circular genomes over time. 

An increase in the persistence of EBNA- 1 hybridizing Hirt DNA in cells 
infected with the EBNA-1 vector alone, as compared to GFP-probed Hirt 
Southerns of cells infected with only the GFP encoding rAAV construct, is 
10 observed as monomer circularized EBNA-1 vectors are the predominant form of 
autonomously replicating episomes. However, when cells are infected with both 
vectors simultaneously, GFP transgene expression is prolonged. Molecular 
analyses are performed to characterize the autonomously replicating circular 
chimeras encoding both GFP and EBNA-1. 
15 This approach may be employed for gene therapy, as circular 

concatamers with as many as five genomes, have been observed. For example, 
for CFTR gene therapy, human bronchial xenografts and the RSV-EBNA-1 
rAAV vector and a second CFTR rAAV vector with a minimal promoter may be 
employed. If genomes are oriented in a head-to-head fashion, the RSV enhancer 
20 may increase transcription from the minimal CFTR promoter. 
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certain of the details described herein may be varied considerably without 
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WHAT IS CLAIMED IS: 



1 . A composition comprising at least two recombinant adeno-associated 
viruses (AAV), comprising: 

a) a first recombinant AAV comprising a first recombinant DNA 
molecule comprising linked: 

i) a first DNA segment comprising a 5 '-inverted terminal 
repeat of AAV; 

ii) a second DNA segment which does not comprise AAV 
sequences; and 

iii) a third DNA segment comprising a 3 '-inverted terminal 
repeat of AAV; and 

b) a second recombinant AAV comprising a second recombinant 
DNA molecule comprising linked: 

i) a first DNA segment comprising a S'-inverted terminal 
repeat of AAV; 

ii) a second DNA segment which does not comprise AAV 
sequences and which second DNA segment is different 
than the second DNA segment of the first recombinant 
DNA molecule; and 

iii) a third DNA segment comprising a 3 '-inverted terminal 
repeat of AAV. 



The composition of claim 1 further comprising a delivery vehicle. 

The composition of claim 2 where the vehicle is a pharmaceutical^ 
acceptable carrier. 

The composition of claim 1 wherein the second DNA segment of the first 
recombinant DNA molecule comprises a portion of an open reading 
frame operably linked to a promoter. 
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5. The composition of claim 4 wherein the first recombinant DNA molecule 
comprises a splice donor site 3' to the open reading frame. 

6. The composition of claim 4 wherein the second DNA segment of the 
second recombinant DNA molecule comprises the remainder of the open 
reading frame which together with the second DNA segment of the first 
recombinant DNA molecule encodes a full-length polypeptide. 

7. The composition of claim 6 wherein the second DNA segment of the 
second recombinant DNA molecule comprises a splice acceptor site 5' to 
the remainder of the open reading frame. 

8. The composition of claim 1 wherein the second DNA segment of the first 
recombinant DNA molecule comprises an enhancer. 

9. The composition of claim 1 wherein the second DNA segment of the first 
recombinant DNA molecule comprises a promoter. 

1 0. The composition of claim 8 or 9 wherein the second DNA segment of the 
second recombinant DNA molecule comprises at least a portion of an 
open reading frame. 

1 1 . The composition of claim 1 0 wherein the second DNA segment of the 
second recombinant DNA molecule further comprises a promoter 
operably linked to the open reading frame. 

12. The composition of claim 1 wherein the second DNA segment of the first 
recombinant DNA molecule comprises an origin of replication 
functional in a host cell. 



1 3. The composition of claim 1 2 wherein the origin is a viral origin of 
replication. 
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14. The composition of claim 13 wherein the origin is functional in a human 
cell. 



15. The composition of claim 12 wherein the second DNA segment of the 
first recombinant DNA molecule further comprises DNA encoding a 
protein that binds to the origin of replication. 

16. The composition of claim 15 wherein the second DNA segment in the 
second recombinant DNA molecule comprises at least a portion of an 
open reading frame. 

1 7. The composition of claim 16 wherein the second DNA segment in the 
second recombinant DNA molecule further comprises a promoter 
operably linked to the open reading frame. 

18. A recombinant adeno-associated viral vector comprising a heterologous 
origin of replication. 

19. A recombinant adeno-associated viral vector comprising at least one 
heterologous transcriptional regulatory element functional in a host cell 
and which does not comprise sequences which encode a protein. 

20. The vector of claim 1 9 wherein the element is a promoter. 

2 1 . The vector of claim 1 9 wherein the element is an enhancer. 

22. A recombinant adeno-associated viral vector comprising a DNA 
sequence encoding a protein that binds to a heterologous origin of 
replication. 

23. A plasmid comprising the vector of claim 18, 19 or 22. 



24. A host cell contacted with the composition of claim 1 . 
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25. A host cell contacted with at least two recombinant AAV, 

wherein a first recombinant AAV comprises a first recombinant DNA 
molecule comprising linked: 

i) a first DNA segment comprising a 5 '-inverted terminal 
repeat of AAV; 

ii) a second DNA segment which does not comprise adeno- 
associated viral sequences; and 

iii) a third DNA segment comprising a 3 '-inverted terminal 
repeat of AAV; and 

wherein a second recombinant AAV comprises a second recombinant 
DNA molecule comprising linked; 

i) a first DNA segment comprising a S'-inverted terminal 
repeat of AAV; 

ii) a second DNA segment which does not comprise adeno- 
associated viral sequences and which second DNA 
segment is different than the second DNA segment of the 
first recombinant DNA molecule; and 

iii) a third DNA segment comprising a 3 '-inverted terminal 
repeat of AAV. 



26. A method to transfer recombinant DNAs to a host cell, comprising: 
contacting the host cell with at least two recombinant AAV, 
wherein a first recombinant AAV comprises a first recombinant DNA 
molecule comprising linked: 

i) a first DNA segment comprising a 5'-inverted terminal 
repeat of AAV; 

ii) a second DNA segment which does not comprise adeno- 
associated viral sequences; and 

iii) a third DNA segment comprising a 3 '-inverted terminal 
repeat of AAV; and 

wherein a second recombinant AAV comprises a second recombinant 
DNA molecule comprising linked: 

i) a first DNA segment comprising a S'-inverted terminal 
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repeat of AAV; 

ii) a second DNA segment which does not comprise adeno- 
associated viral sequences and which second DNA 
segment is different than the second DNA segment of the 
first recombinant DNA molecule; and 

iii) a third DNA segment comprising a 3 '-inverted terminal 
repeat of AAV 

27. A method to transfer and express a polypeptide in a host cell comprising 
contacting the host cell with the composition of claim 1. 

28. The method of claim 26 or 27 wherein the second DNA segment of the 
first recombinant DNA molecule comprises a portion of an open 
reading frame operably linked to a promoter. 

29. The method of claim 28 wherein the first recombinant DNA molecule 
comprises a splice donor site 3' to the open reading frame. 

30. The method of claim 29 wherein the second DNA segment of the 
second recombinant DNA molecule comprises the remainder of the open 
reading frame which together with the second DNA segment of the first 
recombinant DNA molecule encodes a full-length polypeptide. 

3 1 . The method of claim 30 wherein the second DNA segment of the 
second recombinant DNA molecule comprises a splice acceptor site 5' to 
the remainder of the open reading frame. 

32. The method of claim 26 or 27 wherein the second DNA segment of the 
first recombinant DNA molecule comprises an enhancer. 

33. The method of claim 26 or 27 wherein the second DNA segment of the 
first recombinant DNA molecule comprises a promoter. 
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34. The method of claim 32 wherein the second DNA segment of the 
second recombinant DNA molecule comprises at least a portion of an 
open reading frame. 

35. The method of claim 33 wherein the second DNA segment of the 
second recombinant DNA molecule comprises at least a portion 
of an open reading frame. 

36. The method of claim 34 wherein the second DNA segment of the 
second recombinant DNA molecule further comprises a promoter 
operably linked to the open reading frame. 

37. The method of claim 35 wherein the second DNA segment of the 
second recombinant DNA molecule further comprises a promoter 
operably linked to the open reading frame. 

38. The method of claim 26 or 27 wherein the second DNA segment of the 
first recombinant DNA molecule comprises an origin of 
replication functional in a host cell. 

39. The method of claim 38 wherein the origin is a viral origin of 
replication. 

40. The method of claim 39 wherein the origin is functional in a human 
cell. 



41 . The method of claim 26 or 27 wherein the second DNA segment of the 
first recombinant DNA molecule further comprises DNA encoding a 
protein that binds to the origin of replication. 

42. The method of claim 41 wherein the second DNA segment in the 
second recombinant DNA molecule comprises a portion of an open 
reading frame. 
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43. The method of claim 41 wherein the second DNA segment in the 
second recombinant DNA molecule further comprises a promoter 
operably linked to the open reading frame. 

44. A recombinant adeno-associated viral vector comprising a DNA 
sequence encoding a protein that binds to a heterologous origin of 
replication for use in medical therapy. 

45. A recombinant adeno-associated viral vector comprising a heterologous 
origin of replication for use in medical therapy. 
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A 

10 20 30 40 50 TO 

GCATGCAAGC TGTAGATAAG TAGCATGGCG GGTTAATCAT TAACTACAAG GAACCCCTAG 
CCTACGTTCG ACATCTATTC ATCGTACCGC CCAATTAGTA ATTGATGTTC CTTGGGGATC 

70 80 90 100 HO 120 

TGATGGAGTT GGCCACTCCC TCTCTGCGCG CTCGCTCGCT CACTGAGGCC GGGCGGCCAA 
ACTACCTCAA CCGGTGAGGG AGAGACGCGC GAGCGAGCGA GTGACTCOGG CCCGCCGGTT 

130 140 150 160 170 180 

AGGTCGCCCG ACGCCCGGGC TTTGCCCGGG CGGCCTCAGT GAGCGAGCGA GCGCGCAGAG 
TCCAGCGGGC TGCGGGCCCG AAACGGGCCC GCCGGAGTCA CTCGCTCGCT CGCGCGTCTC 

190 200 210 220 230 240 

AGGGAGTGGC CAACTCCATC ACTAGGGGTT CCTTGTAGTT AATGATTAAC CCGCCATCCT 
TCCCTCACCG GTTGAGGTAG TGATCCCCAA GGAACATCAA TTACTAATTG GGCGGTACGA 

250 260 270 280 

ACTTATCTAC CGATGAATTC GAGCTTGCAI GC 

TGAATAGATG GCTACTTAAG CTCGAACGTA Cfi 



B 

10 20 30 40 50 60 

GCATCCAAGC TGTAGATAAG TAGCATGGCG GGTTAATCAT TAACTACAAG GAACCCCTAG 
CGTACGTTCG ACATCTATTC ATCGTACCGC CCAATTAGTA ATTGATGTTC CTTGGGGATC 

70 80 90 100 HO 120 

TGATGGAGTT GGCCACTCCC TCTCTGCGCG CTCGCTCGCT CACTGAGGCC GGGCGCGOGC, 
ACTACCTCAA CCGGTGAGGG AGAGACGCGC GAGCGAGCGA GTGACTCCGG CCCGCGCGCG 

130 140 150 160 170 180 

TCGCTCGCTC ACTGAGGCCG GGCGACCAAA GGTCGCCCGA GCCCGGGCTT TGCCCGGGCG 
AGCGAGOGAG TGACTCCGGC CCGCTGGTTT CCAGCGGGCT CGGGCCCGAA ACGGGCCCGC 

190 200 210 220 230 240 

GCCTCAOTOA GCGAGCGCGC GCGCAGAGAG GGAGTGGCCA ACTCCATCAC TAGGGGTTCC 
CGGAGTCACT CGCTCGCGCG CGCGTCTCTC CCTCACCGGT TGAGGTAGTG ATCCCCAAGG 

* • 

250 260 270 280 . 290 300 

TTCTAGTTAA TGATTAACCC GCCATGCTAC TTATCTACCG ATGAATTCGA GCTTSCA2X5C 
AACATCAATT ACTAATTGGG CGGTACGATG AATAGATGGC TACTTAAGCT CGAACGIACS 
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CG&mCAAGC TGTAGATAAG TAGCATGGCG GGTTAATCAT TAACTACAAG GAACCCCTAG 
CGXACCTFCG ACATCTATTC ATCGTACCGC CCAATTAGTA ATTGATGTTC CTTGGGGATC 

70 80 90 100 110 120 

TGATCGAGTT GGCCACTCCC TCTCTGCGCG CTOGCTOGCT CACTGAGGCC GGGCGACCAA 
ACTACCTCAA CCGGTGAGGG AGAGACGCGC GAGCGAGCGA GTGACTCCGG CCCGCTCGTT 

130 140 150 160 170 180 

AGGTCGCCOG KCGCCCGGGC TTTGGTCGCC CGGCCTCAGT GAGCGAGCGA GCGCGCAGAG 
TCCAGCGGGC TCCGGGCOCG AAACCAGCGG GCCGGAGTCA CTCGCTCGCT CGCGCGTCTC 

190 200 210 220 230 240 

AGGGAGTGGC CAACTCCATC ACTAGGGGTT CCTTGTAGTT AATGAOTAAC CCGCCATGCT 
TCCCTCACCG GTTGAGGTAG TGATCCCCAA GGAACATCAA TTACTAATTG GGCGGTACGA 

250 260 270 280 

ACTTATCTAC CGATGAATTC GAGCTTGCAX CC 

TGAATAGATG GCTACTTAAG CTCGAACG3CA OS 
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10 * 20- 30 40 50 

P81 1 CCATfiGAAGC TCTAGATAAG TAGCATGGCG GGTTAATCAT TAACTACAAG 50 

p79 1 GG&IGCAAGC TCTAGATAAG TAGCATGGCG GGTTAATCAT TAACTACAAG 50 

pl202 1 GCATGCAAGC TCTAGATAAG TAGCATGGCG GGTTAATCAT TAACTACAAG 50 

60 70 80 90 100 

P81 51 GAACCCCTAG TGATGGAGTT GGCCACTCCC TCTCTGCGCG CTCGCTCGCT 100 

p79 51 GAACCCCTAG TGATGGAGTT GGCCACTCCC TCTCTGCGCG CTCGCTCGCT 100 

pi 2 02 51 GAACCCCTAG TGATGGAGTT GGCCACTCCC TCTCTGCGCG CTCGCTCGCT 100 

110 120 130 140 150 

P81 101 CACTGAGGCC GGGCG GCCAAAG 150 

p79 101 CACTGAGGCC GGGCGCGCGC TCGCTCGCTC ACTGAGGCCG GGCGaCcAAa 150 
pl202 101 CACTGAGGCC GGGCG— ACCAAAG 150 

160 170 , - 180 190 200 

P81 151 GTCGOCCGAC GCCCGGGCTT TGCCCGGGCG GCCTCAGTGA GCGAGCGAGC 200 
p79 151 GgtcgCCcga GCCCGGGCTT TGCCCGGGCG GCCTCAGTGA GCGAGCGcGC 200 
pl202 151 GTCGCCCGAC GCCCGGGCTT TGgtCGccCG GCCTCAGTGA GCGAGCGAGC 200 

210 220 230 240 250 

P81 201 GCGCAGAGAG GGAGTGGCCA ACTCCATCAC TAGGGGTTCC TTGTAGTTAA 250 
p79 201 GCGCAGAGAG GGAGTGGCCA ACTCCATCAC TAGGGGTTCC TTGTAGTTAA 250 
pl202 201 GCGCAGAGAG GGAGTGGCCA ACTCCATCAC TAGGGGTTCC TTGTAGTTAA 250 

260 270 280 290 300 

P81 251 TGATTAACCC GCCATGCT AC TTATCTACCG ATGAATTCGA GCTTGCATGC 300 
p79 251 TGATTAACCC GCCATGCTAC TTATCTACCG ATGAATTCGA GCTTGCATCC 300 
p!202 251 TGATTAACCC GCCATGCTAC TTATCTACCG ATGAATTCGA GCTTGCATGC 300 



WO 01/25465 PCT/US00/27863 

13/32 




14/32 



PC1YUS00/27863 




15/32 



PCI7US00/27863 



~T 



=2=o 



Sense 



Strand DNA Secondary Confirmation 



p5-D-A-B-B • -C-B'-A'-D'-pS 1 




WO 01/25465 PCT/USO0/27863 

16/32 




WO 01/25465 



17/32 



PCT/US00/27863 



rv 



B 



AV.GFP3ori p^ fCMVg 

(Obp) Snabl 
(657fap) 



GFP 



'Amp 



Ort 



NoU 
(1918bp) 



Asel 
(3642bp) 



(4708bp) 



Sphl Sphl cial 
OTB ^abpXWIbp) (2781 bp) 3m 

AVJMkphos |^ TrsvU| Alkphos(AP) 

(Obp) HindHI Snabl (3208bp) 

(781bp) (2941bp) 



Model 1 
Rolling Circular Replication 



AVJUkphos' 



Model 2 
Intermotecular Recombination 

fej!GFMaf3 ^ AVJUkphosll 



Co-infection 
Circularization 





Co-Infection 
Circularization 





Concatamerization 




Dimers and 
Multimers 



Rescue 



Plasmlds Express 
GFP only 



Concatamerization 




Dimers and 
Multimers 



Rescue^ 



Plasmlds Express 
GFP and GFP/Alkphos 



4 




WO 01/25465 



19/32 



PCT/US00/27863 



A BCD 



1 

2 
3 
4 
5 
6 
7 
8 
9 
10 



Probe: Alkphos 





ABCD 




ABCD 


1 


• 


# • • 


1 


• • • • 


2 


• 


• • • 


2 


• • • • 


3 


♦ 


• • • 


3 




4 




• • 


4 




5 


m 


# • • 


5 


# • # • 


6 


m 




6 


# • • • 


7 


m 


• • • 


7 




8 




• • • 


8 






9 


• 


• • • 


9 


# 




10 




• • • 


10 


• 






EGFP 






Amp 



14days(A1-A7) 
35days(Bl-BlO) 
80days(C1-C10) 
120 days (Dt--oiO) 

A8 (Amp control) 
A9 (GFP control) • 
A10 (Alkphos control) 



(0 

21600' 

E 

S 

g 1000 

=3 

o 

CO 
CD 

DC 

O 500 

<D - 

e 



B 



[ GFP(+VAP(+) 
|GFP(+) 



P 




14 35 80 

Days Post-Infection 




120 



25 50 75 100 125 

Days Post-lnfectlori 



WO 01/25465 



20/32 



PCTAJS00/27863 



NomarskJ GFP Alkphos 




Days PosHnfecction 



WO 01/25465 



21/32 



PCT/US00/27863 



A 



12WH 

3kbJ 
Skb- 
lkb-; 

0.5kb- 
Probe: 




EtBr 



EGFP 



Alkphos 



ITR 



B 



8naBI 



HMIU 
(18.1kb) 



(2aikb). 



(Okb) 



Asel 
(1.1kb) 



SnaBl 
(16161(b) 



Not! 
(15.4tt>) 



Acel 
(13.6kf>) 



8naBI 
(11^Wb) 



Nod 
(18 to) 



SnaBl 
(4.1kb) 




(10.7W>) (8.9 to) on 1^ 



Ace! 



HWIII 
(T.1to) 



*900bp deletion 



OH Intact 



5TTR 



3'ITR 



QPP . Alkphos 



WO 01/25465 



22/32 



PCT/US00/27863 



o 



i 



3 X 2 CO COX 



12kb- 
6kb- 

3kb- 
2kb- 

1kb- 

0.5kb- 



Probe: 



EtBr 



5 S« 



.= -'-» u-u.-— c — id to. L- t~~ 

X 2 CO COX ~5 X 2 CO COX 



I = _| 
aX2coco£ 





EGFP 



Alkphos 



D 



*Sph! 

deleted SnaBI 
<1Z4*b) «>) (0.27W)) 



Sphl 
(Z3kb) 



SnaBI 
<&58fcb) 



Sphl 
(B.1tt>) 




Kndlll 
(2.4fcb) 

Sphl 
(2.6M)) 



Sphl 
(3.39kb) 



Sphl 
(7.74Mb) 



SnaBI 
(3.e7kb) 



Nod 

(5,13W>) 



S'lTR 3 f (TR 



QPP . Alkphos 



WO 01/25465 



ITR Promoter 



Splice 
ITR Acceptor 



23/32 



Transgene (5* End) 



PCT/US00/27863 



Splice 
Donor ITR 



Transgene (3* End) Poly A ITR 



Vector 1 



Vector 2 



6 



Transgene 
(5«End) 




Transgene 
(3' End) 




WO 01/25465 PCT/USOO/27863 

24/32 




WO 01/25465 



25/32 



PCT/US00/27863 




< 

Ql 

O 

> 
CO 
*o 

CO 
CD 

© 
o 

3 





o 

ZJ 



> 
< 



o 
3 



> 

CO 

> 
< 



o 
ZJ 



LU 

CL 

> 
CO 

> 
< 



c 
LU 

CL 
ZJ 

CO 

> 
< 



O 

CL 

< 

> 
< 



FIG. 20 



WO 01/25465 



26/32 



PCTYUS00/27863 




FIG. 21 




FIG. 22 




FIG. 23 



WO 01/25465 



29/32 



PCT/US00/27863 




WO 0)725465 



30/32 



PCTYUSOO/27863 



AV 



Exonl Exon2 Exon3 



CMV 



AV.Epo2 



Promotef 
Exon4 ExonS EGFP 



5*Epo 



B 



3'Epo IRES SV40 

polyA 




Exont Exon2 Exon3 



hnRNA 



Splicing 
mRNA 



Exon4 ExonS 



IRES 



EGFP 



tntermolecular 
ITR Junction 



I AAA 



■AAA 



Mature 
mRNA 



EPO 



IRES EGFP 



I AM 



WO 01/25465 



31/32 



PCTAJS00/27863 



o 

<D 
O 



3 

o 

CL 



1000 
900 J 
800 
700 
600 
500 
400 H 
300 
200 -I 
100 
0 



■■AV.Epo1+AV.Epo2 

AV.Epol 
AV.Epo2 



T 



10 



Days Post-Infection 



WO 01/25465 



32/32 



PCT/US00/2->863 



62 
60 
* 58 



•c 

o 56 
o 

CO 

£ 54 H 



52 -J 
50 
48 
46 



• Non-infected (n=4) 

■ AV.Epol orAV.Epo2 (n=4) 
(each in an independent muscle) 

AV.Epol + AV.Epo2 (n-6) 




— , — 

63 



10 21 35 48 

Time (Days) 



80 



— i — 
94 



WO 01/25465 



PCTAJS00/27863 



SEQUENCE LISTING 

<110> University of Iowa Research Foundation 
5 Engelhardt, John F. 

Duan , Dongsheng 
Yan, Ziying 

<120> Adeno-associated virus vectors and uses thereof 

: 10 

<130> 875.024WO1 

<150> US 60/158,209 
<151> 1999-10-07 

15 

<160> 13 

<170> FastSEQ for Windows Version 4.0 

20<210> 1 
<211> 20 
<212> DNA 

<213> Adeno-associated virus 

25<400> 1 

cgggggtcgt tgggcggtca 

<210> 2 
<211> 19 
30<212> DNA 

<213> Adeno-associated virus 

<400> 2 

gggcggagcc tatggaaaa 

35 

<210> 3 
<211> 505 
<212> DNA 

<213> Artificial Sequence 
40<220> 

<22 3> A consensus sequence. 
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<400> 3 



cgggggtcgt tgggcggtca gccaggcggg ccatttaccg taagttatgt aacgactgca 60 

ggcatgcaag ctcgaattca tcggtagata agtagcatgg cgggttaatc attaactaca 120 

aggaacccct agtgatggag ttggccactc cctctctgcg cgctcgctcg ctcgctgagg 180 

Sccgggcgacc aaaggtcgcc cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc 240 

gagcgcgcag ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg 3 00 

ggcgaccttt ggtcgcccgg cctcagcgag cgagcgagcg cgcagagagg gagtggccaa 360 

ctccatcact aggggttcct tgtagttaat gattaacccg ccatgctact tatctacagc 420 

ttgcatgcat gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc 4Q0 

lOtggcgttttt ccataggctc cgccc 505 

<210> 4 
<211> 272 
<212> DNA 

15<213> AAV circular intermediate, clone p81 
<400> 4 

gcatgcaagc tgtagataag tagcatggcg ggttaatcat taactacaag gaacccctag 60 
tgatggagtt ggccactccc tctctgcgcg ctcgctcgct cactgaggcc gggcggccaa 120 
20aggtcgcccg acgcccgggc tttgcccggg cggcctcagt gagcgagcga gcgcgcagag 180 
agggagtggc caactccatc actaggggtt ccttgtagtt aatgattaac ccgccatgct 240 
acttatctac cgatgaattc gagcttgcat gc 272 

<210> 5 
25<211> 300 
<212> DNA 

<213> AAV circular intermediate, clone p79 
<400> 5 

3 0gcatgcaagc tgtagataag tagcatggcg ggttaatcat taactacaag gaacccctag 6 .0 
tgatggagtt ggccactccc tctctgcgcg ctcgctcgct cactgaggcc gggcgcgcgc 120 
tcgctcgctc actgaggccg ggcgaccaaa ggtcgcccga gcccgggctt tgcccgggcg 180 
gcctcagtga gcgagcgcgc gcgcagagag ggagtggcca actccatcac taggggttcc 240 
ttgtagttaa tgattaaccc gccatgctac ttatctaccg atgaattcga gcttgcatgc 300 



35 

<210> 6 
<211> 272 
<212> DNA 

<213> AAV circular intermediate, clone pl202 
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<400> 6 






3 






gcatgcaagc 


tgtagataag 


tagcatggcg ggttaatcat 


taactacaag gaacccctag 


60 


tgatggagtt 


ggccactccc 


tctctgcgcg 


ctcgctcgct 


cactgaggcc gggcgaccaa 


120 


aggtcgcccg 


acgcccgggc 


tttggtcgcc 


cggcctcagt 


gagcgagcga gcgcgcagag 


180 


Sagggagtggc 


caactccatc 


actaggggtt 


ccttgtagtt 


aatgattaac ccgccatgct 


240 


acttatctac 


cgatgaattc 


gagcttgcat 


gc 




272 



<210> 7 
<211> 165 
10<212> DNA 
<213> Unknown 

<220> 

<223> SEQ ID NO:l of U.S. Patent No. 5,478,745 

15 

<400> 7 

aggaacccct agtgatggag ttggccactc cctctctgcg cgctcgctcg ctcactgagg 60 
ccgggcgacc aaaggtcgcc cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc 120 
gagcgcgcag agagggagtg gccaactcca tcactagggg ttcct 165 

20 

<210> 8 
<211> 282 
<212> DNA 

<213> rAAV circular intermediate, clone p79 

25 

<400> 8 



ggcgggccat ttaccgtaag ttatgtggcg -actgcaggca tgcaagctcg aattcatcgg 60 

tagataagta gcatggcggg ttaatcattg cctacaaaga gcccctagtg atggagtggg 120 

ccactccctc tcttcgccga gcgcgcagag agggagtggc caactccctc actaggggtt 180 

30cctggcagtt aatgattaac ccgccatgct acttatctac agcttgcatg catgtgagca 240 

aaaggccagc aaaaggccag gaaccgtaaa aaggccgcgt tg 2 82 



<210> 9 
<211> 345 
35<212> DNA 

<213> rAAV circular intermediate, clone p80 

<400> 9 

ggccatttac cgtaagttat gtaacgactg caggcatgca agctcgaatt catcggtaga 60 
4 0taagtagcat ggcgggttaa tcattaacta caaggaaccc ctagtgatgg agttggccac 120 
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tccctctctg cgcgctcgct cgctcgctca ggccgggcga ccaaaggtcg cccgacgccc 180 
gcccggcctc agcgagcgag cgagcgcgca gagagggagt ggccaactcc atcactaggg 24 0 

gttccttgta gttaatgatt aacccgccat gctacttatc tacagcttgc atgcatgtga 300 
gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg cgttg 345 

5 

<210> 10 
<211> 276 
<212> DNA 

<213> rAAV circular intermediate, clone p81 

10 

<400> 10 

ggccatttac cgtaagttat gtggcgactg caggcatgca agctcgaatt catcggtaga 60 
taagtagcat ggcgggttaa tcattgccta caaagagccc ctagtgatgg agcccggcct 120 
caccgagcga gcgagcgcgc agagagggag tggccaactc catcactagg ggttccttgt 180 
ISagttaatgat taacccgcca tgctacttat ctacagcttg catgcatgtg agcaaaaggc 240 
cagcaaaagg ccaggaaccg taaaaaggcc gcgttg 276 

<210> 11 

<21l> 316 

20<212> DNA 

<213> rAAV circular intermediate, clone p86 

<400> 11 

ggccatttac cgtaagttat gtaacgactg caggcatgca agctcgaatt catcggtaga 60 
25 taagtagcat ggcgggttaa tcattaacta caaggaaccc ctagtgatgg agttggccac 120 
tccctctctg cgcgctcgct cgctcgctga ggccgccccg gcctcagcga gcgagcgagc 180 
gcgcagagag ggactggcca actccatcac taggggttcc ttgtagttaa tgattaaccc 240 
gccatgctac ttatctacag cttgcatgca tgtgagcaaa aggccagcaa aaggccagga 300 
accgtaaaaa ggccgc 316 

30 

<210> 12 
<211> 208 
<212> DNA 

<213> rAAV circular intermediate, clone p87 

35 

<400> 12 

ggccatttac cgtaagttat gtaacgactg caggcatgca agctcgaatt catcggtaga 60 
taagtagcat ggcgggttac tcattgccta caaagagccc ctagtgatgg aattggaatg 120 
attcaccctc catgctactt atctacagct tgcatgcatg tgagcaaaag gccagcaaaa 180 
40ggccaggaac cgtaaaaagg ccgcgttg 208 
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<210> 13 
<211> 310 
<212> DNA 

<213> rAAV circular intermediate, clone p88 

5 

<400> 13 

gccatttacc gtaagttatg taacgactgc aggcatgcaa gctcgaattc atcggtagat 60 
aagtagcatg gcgggttaat cattgcctac aaagagcccc tagtgatgga gttggccact 120 
ccctctctgc gcgctcgctc gctgggcccg gcctcagcga gcgagcgagc gcgcagagag 180 
lOggagtggcca actccatcac taggggttcc ttgtagttaa tgattaaccc gccatgctac 240 
ttatctacag cttgcatgca tgtgagcaaa aggccagcaa aaggccagga accgtaaaaa 300 
ggccgcgttg 310 
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